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OM nucleic - nucleic search, using sw model 
Run on: 



updates/sec 
Title: 

Perfect score: 

Sequence: 

2766 

Scoring table: 



Searched: 



January 22, 2005, 07:16:57 ; Search time 8599.76 Seconds 

(without alignments) 
11720.367 Million cell 



US-10-054-680-1 
2766 

1 atggcgtggttaaggttgca gctacatcaaggggttctaa 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

32822875 seqs, 18219865908 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



65645750 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being 
printed, 

and is derived by analysis of the total score distribution. 
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Result Query 

No. Score Match Length DB ID 



Description 
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BC036783 
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1786.4 


64. 
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1788 
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AY401283 


AY401283 
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1741.4 


63. 
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1788 


9 


AY401284 


AY401284 



4 1624.4 58.7 2534 

muscu 

5 1546.4 55.9 1788 

muscu 

6 1258.2 45.5 4374 

muscu 

7 1208.8 43.7 2922 

sapi 

8 1169.8 42.3 2922 

muscu 

9 1151.4 41.6 2516 

sapi 

10 978.4 35.4 2881 
trogl 

11 971 35.1 1589 
human ful 

12 948.4 34.3 3573 
muscu 

13 899.2 32.5 938 
BX374548 

14 874.8 31.6 2515 
muscu 

15 813.8 29.4 887 
603178823 

16 768.2 27.8 939 
BX347210 

c 17 743.4 26.9 791 
603175911 

18 739.8 26.7 792 
603175911 

19 737.8 26.7 916 
BX368185 

20 708.4 25.6 894 
BX390204 

21 698.2 25.2 2472 
trogl 

22 673.2 24.3 775 
M-FI0- 

23 657 23.8 665 
E-EJ1- 

24 605.4 21.9 704 
M-HD0- 

25 598.6 21.6 944 
BX368184 

c 26 579.2 20.9 753 
muscu 

27 574.6 20.8 752 
M-GH0- 

28 560.2 20.3 676 
BB280958 

c 29 553.6 20.0 1062 
Tetraodon 

30 516 18.7 527 
602805921 

31 503.8 18.2 588 
M-FY0- 

32 483.2 17.5 851 
BX325851 

33 480 17.4 1055 
603138368 



3 AK044636 

9 AY401285 

3 AK035163 

9 AY398961 

9 AY398963 

9 AY408693 

9 AY398962 

3 CNSLT1IBJ 

3 AK048160 
5 BX374548 
9 AY408695 

4 BI913344 

5 BX347210 
4 BI523145 

4 BI522813 

5 BX368185 
5 BX390204 
9 AY408694 
5 BQ770745 
5 BQ189572 
7 CF729293 
5 BX368184 
9 AG462634 
7 CF532853 
2 BB280958 
9 CNS04DXR 

4 BG910325 
7 CF533347 

5 BX325851 
5 BU116565 



AK044636 Mus 
AY401285 Mus 
AK035163 Mus 
AY398961 Homo 
AY398963 Mus 
AY408693 Homo 
AY398962 Pan 
BX248763 
AK048160 Mus 
BX374548 
AY408695 Mus 
BI913344 
BX347210 
BI523145 
BI522813 
BX368185 
BX390204 
AY408694 Pan 
BQ770745 UI- 
BQ189572 UI- 
CF729293 UI- 
BX368184 
AG462634 Mus 
CF532853 UI- 
BB280958 
AL286344 
BG910325 
CF533347 UI- 
BX325851 
BU116565 
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ALIGNMENTS 



BC036783 



3186 bp 



mRNA 



linear HTC 19- 



member 3, mRNA (cDNA clone IMAGE: 5732743) , with apparent 



RESULT 1 
BC036783 
LOCUS 
NOV-2003 

DEFINITION Homo sapiens solute carrier family 8 ( sodium- calcium 
exchanger) , 

retained 

intron . 
ACCESSION BC036783 

VERSION BC036783.1 GI:23331089 

KEYWORDS HTC. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 3186) 

AUTHORS Strausberg, R. L. , Feingold, E.A. , Grouse, L.H., Derge,J.G., 
Klausner, R. D. , Collins , F. S . , Wagner, L., Shenmen, C .M. , 
Schuler, G. D. , 

Altschul, S. F. , Zeeberg, B. , Buetow, K.H. , Schaef er, C . F. , 

Bhat,N.K. , 

Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J., 

Hsieh, F. , 

Diatchenko, L. , Marusina,K. , Farmer, A. A. , Rubin, G.M. , 

Hong, L . , 

Stapleton,M. , Soares,M.B., Bonaldo, M. F. , Casavant, T . L. , 
Scheetz, T . E. , Browns tein,M. J. , Usdin, T. B. , Toshiyuki, S . , 
Carninci, P . , Prange, C. , Raha, S . S . , Loquellano, N . A. , 

Peters, G. J. , 



Abramson, R. D . , Mullahy, S . J. , Bosak, S .A. , McEwan, P. J. , 
McKernan, K. J. , Malek, J .A. , Gunaratne, P . H. , Richards, S . , 
Worley,K.C, Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon, D.K. , Muzny, D.M. , Sodergren, E . J. , Lu,X., 

Fahey,J., Helton, E., Ketteman,M. , Madan,A. , Rodrigues , S . , 
Sanchez, A., Whiting, M. , Madan,A., Young,A.C, Shevchenko, Y. 
Bouf f ard, G. G. , Blakesley, R. W. , Touchman, J. W. , Green, E.D., 
Dickson, M.C. , Rodriguez, A. C. , Grimwood, J. , Schmutz, J. , 

Butterfield, Y.S. , Krzywinski,M. I . , Skalska,U. , Smailus , D. E. 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full- 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 
22388257 
12477932 

2 (bases 1 to 3186) 
Strausberg, R. 
Direct Submission 

Submitted (23-AUG-2002) National Institutes of Health, 
Gene Collection (MGC) , Cancer Genomics Office, National 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892- 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact: MGC help desk 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: Invitrogen 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: National Institutes of Health Intramural 

Sequencing Center (NISC) , 

Gaithersburg, Maryland; 

Web site: http://www.nisc.nih.gov/ 

Contact : nis c_mgc@nhgri . nih . gov 

Akhter,N. , Ayele, K. , Becks trom-Sternberg, S .M. , Benjamin, B. , 
Blakesley, R.W. , Bouf f ard, G . G . , Breen,K. , Brinkley,C. , 

Brooks, S . , 

Dietrich, N. L. , Granite, S., Guan,X., Gupta, J., Haghighi,P., 
Hansen, N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., 

Legaspi, R. , 

Maduro,Q.L. , Masiello,C. , Maskeri,B. , 
Mastrian, S . D. ,McCloskey, J.C. , 

McDowell, J. , Pearson, R. , Stantripop, S . , Thomas, P. J. , 
Touchman, J. W. , 

Tsurgeon, C. , Vogt, J. L. , Walker, M. A. , Wetherby, K. D . , 

Wiggins, L. , 

Young, A., Zhang, L.-H. and Green, E.D. 

Clone distribution: MGC clone distribution information can 

be found 

through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 

Series: IRAK Plate: 79 Row: j Column: 21 

This clone was selected for full length sequencing because 

it 



Gibbs,R.A. , 



Myers, R.M. 

TITLE 
length 

JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

Mammalian 

Cancer 

2590, 

REMARK 
COMMENT 



passed the following selection criteria: matched mRNA gi 

17865803 

This clone has the following problem: retained intron. 
FEATURES Location/Qualifiers 
source 1. .3186 

/organism="Homo sapiens" 

/mol_t ype= "mRNA" 

/db_xref="taxon: 9606" 

/ cl one= " IMAGE : 5 7 32 7 4 3 " 

/ tissue_type="Brain / hippocampus " 

/ clone_lib="NIH_MGC_12 4 " 

/lab_host="DH10B" 

/note="Vector: pCMV-SPORT6" 

ORIGIN 

Query Match 95.0%; Score 2627.4; DB 3; Length 3186; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2628; Conservative 0; Mismatches 1; Indels 0; 
Gaps 0; 

Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 558 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 617 
Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 618 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 677 
Qy 121 

ACAGGGC AGAACAAT GAGT CCT GTT CAGGGT CAT C GGACT GCAAGGAGGGT GT CAT CCT G 180 

II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 678 

ACAGGGCAGAACAAT GAGTCCT GTT CAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 737 
Qy 181 

CCAAT CT GGT ACC CGGAGAACC CT T CC CTT GGGGACAAGATT GC CAGGGT CATTGT CTAT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 738 

C CAAT CT GGT ACC CGGAGAAC C CT T CCCTT GGGGACAAGATTGCCAGGGT CATT GT CT AG 7 97 
Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 798 

TTT GT GGC C CTGAT AT AC ATGT T C CTT GGGGTGT C CAT C ATT GCT GACC GCT T CAT GGCA 857 
Qy 301 

T CT ATT GAAGT CATCACCT CT CAAGAGAGGGAGGT GACAAT TAAGAAACC CAAT GGAGAA 360 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 858 



T CT ATT GAAGTCAT CAC CT CT CAAGAGAGGGAGGT GACAATTAAGAAAC CCAAT GGAGAA 917 
Qy 361 

ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I 
Db 918 

ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 977 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I II I I I I I I I I I I I I I I I I 
Db 978 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 1037 
Qy 481 

ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 540 

I I I I I I I I I I I I I I I I I I I l"l I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1038 

ATT GCT GGT GAT CT GGGAC CTT CT AC CATT GT AGGGAGT GCAGCCTT CAACAT GTT CAT C 1097 
Qy 541 

AT CAT T GGC AT CT GT GT CT AC GT GAT C CCAGAC GGAGAGACT CGCAAGAT CAAGC AT CT A 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1098 

AT CAT T GGCAT CT GTGT CT AC GT GAT C CCAGACGGAGAGACT CGCAAGAT CAAGCATCT A 1157 
Qy 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 1158 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 1217 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1218 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 1277 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I I I I I II I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1278 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 1337 
Qy 781 

AT GCACAAAAAGT AC C GCACAGACAAACACCGAGGAATT AT C AT AGAGACAGAGGGTGAC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1338 

AT GCACAAAAAGT AC C GCACAGACAAAC ACCGAGGAAT TAT CAT AGAGACAGAGGGT GAC 1397 
Qy 841 

CACCCTAAGGGCATTGAGAT GGAT GGGAAAAT GATGAATT CCCATTTT CTAGATGGGAAC 900 



Db 1398 

CACC CTAAGGGCATTGAGAT GGATGGGAAAAT GAT GAAT T C C C ATT TT CT AGAT GGGAAC 1457 
Qy 901 

CTGGTGCCCCTGGAAGGGAAGGAAGT GGAT GAGT CCCGCAGAGAGAT GATCCGGATT CTC 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1458 

CTGGTGCCCCT GGAAGGGAAGGAAGT GGATGAGT CCCGCAGAGAGAT GATCCGGATT CTC 1517 
Qy 961 

AAGGAT CTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCT GGTGGAGATGGCCAAT 1020 

I I I I I I 1 I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I. I I I I I I I 
Db 1518 

AAGGAT CT GAAGCAAAAACACC CAGAGAAGGACTT AGAT CAGCT GGTGGAGAT GGCCAAT 1577 
Qy 1021 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

I II I II I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1578 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1637 
Qy 1081 

AT GAT GACT GGTGCAGGCAATAT CCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1638 

AT GAT GACT GGT GCAGGCAAT AT C CT GAAGAAAC AT GCAGCAGAACAAGCCAAGAAGGCC 1697 
Qy 1141 

T C C AGCAT GAGCGAGGT GCACAC CGATGAGCCT GAGGACTT T ATT T CCAAGGTCTT CT TT 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1698 

TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 1757 
Qy 1201 

GACCCATGTT CTTACCAGT GCCT GGAGAACTGT GGGGCT GTACTCCTGACAGTGGTGAGG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1758 

GACC C AT GTT CT T ACCAGT GCCT GGAGAACT GT GGGGCT GT ACTCCT GACAGT GGTGAGG 1817 
Qy 1261 

AAAGGGGGAGAC AT GT CAAAGACCAT GT AT GT GGACT ACAAAACAGAGGAT GGT T CT GCC 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1818 

AAAGGGGGAGACAT GT CAAAGAC CAT GT AT GT GGACT ACAAAACAGAGGAT GGTT CT GCC 1877 
Qy 1321 

AAT GCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCT GAAGCCAGGAGAGACC 1380 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1878 

AAT GCAGGGGCTGACT AT GAGTT CAC AGAGGGC AC GGT GGT T CT GAAGC CAGGAGAGAC C 1937 



Qy 1381 

CAGAAGGAGTT CT C CGT GGGCATAATT GAT GAC GACATTT T T GAGGAGGAT GAACACTTC 1440 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1938 

CAGAAGGAGTT CT C CGT GGGCATAATT GAT GAC GAC ATTTT T GAGGAGGAT GAACACTTC 1997 
Qy 1441 

T T T GTAAGGTT GAGCAAT GT CC GCAT AGAGGAGGAGCAGC C AGAGGAGGGGAT GC CT CCA 1500 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1998 

T TT GTAAGGT T GAGCAAT GT CC GCAT AGAGGAGGAGCAGCC AGAGGAGGGGAT GC CT CCA 2057 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 2058 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 2117 
Qy 1561 

GTT AC C AT CTT GGAT GATGACCAT GCAGGCAT CTT CACTT TT GAAT GT GAT ACT ATT CAT 162 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2118 

GTTACCAT CTTGGATGATGACCATGCAGGCAT CTT CACTTTTGAAT GT GAT ACT ATT CAT 2177 
Qy 1621 

GT CAGT GAGAGT AT T GGT GT TAT GGAGGT CAAGGT T CT GC GGACAT CAG GT GCC C GGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2178 

GT CAGT GAGAGTATT GGT GTTAT GGAGGT CAAGGT TCTGCGGACATCAGGT GCC C GGGGT 2237 
Qy 1681 

AC AGT C AT CGT CC CCTT TAGGAC AGT AGAAGGGAC AGCCAAGGGT GGC GGT GAGGACTTT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2238 

ACAGT CAT C GT C CC CTTT AGGACAGT AGAAGGGACAGC CAAGGGT GGCGGT GAGGACTTT 2297 
Qy 1741 

GAAG AC AC AT AT GG GGAGT T GGAAT T C AAGAAT GAT GAAACT GT GAAAAC C AT AAGGGTT 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2298 

GAAGACACATATGGGGAGTT GGAATT CAAGAAT GAT GAAACT GT GAAAACCATAAGGGTT 2357 
Qy 1801 

AAAAT AGT AG AT G AGGAGGAAT AC GAAAG GCAAGAGAAT T T CT T CAT TGCCCTTGGT GAA 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2358 

AAAAT AGT AGAT GAGGAGGAAT ACGAAAGGCAAGAGAATTT CTT C ATT GCC CTT GGT GAA 2417 
Qy 1861 

C C GAAAT GGAT GGAAC GT GGAAT AT C AGAT GT GACAGACAGGAAGCT GACT ATGGAAGAA 1920 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 

Db 2418 

CCGAAAT GGAT GGAACGT GGAAT AT CAGAT GT GACAGACAGGAAGCT GACTAT GGAAGAA 2477 
Qy 1921 

GAGGAGGCCAAGAGGAT AGCAGAGAT GGGAAAGC CAGT ATT GGGTGAACAC CCCAAACTA 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I . 
Db 2478 

GAGGAGGC CAAGAGGAT AGCAGAGAT GGGAAAGC CAGT ATT GGGT GAACAC CC CAAACTA 2537 
Qy 1981 

GAAGT CAT CATT GAAGAGT C CT AT GAGTT CAAGACT ACGGT GGACAAACT GAT CAAGAAG 2040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2538 

GAAGT CAT CATT GAAGAGT CCT AT GAGTTCAAGACT AC GGT GGACAAACT GAT CAAGAAG 2597 
Qy 2041 

ACAAACCT GGC CTT GGTT GT GGGGAC C CAT T C CT GGAGGGAC C AGTT CAT GGAGGCCAT C 2100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2598 

ACAAACCT GGCCTT GGT T GT GGGGACCC ATT CCT GGAGGGACCAGTT CAT GGAGGC CATC 2657 
Qy 2101 

ACCGT CAGT GC AGCAGGGGAT GAGGAT GAGGAT GAAT C C GGGGAGGAGAGGCT GC C CT CC 2160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2658 

ACC GT CAGT GCAGC AGGGGAT GAGGAT GAGGAT GAAT CCGGGGAGGAGAGGCT GCCCT CC 2717 
Qy 2161 

TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 2220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2718 

TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 2777 
Qy 2221 

CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 2280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2778 

CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 2837 
Qy 2281 

ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 234 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2838 

ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 2897 
Qy 2341 

GAT T CAGT C ACAGCT GT T GTT TT C GTGGCAT TT GGCACCT CTGT C CCAGAT ACGTT T GC C 2400 



Db 2898 

GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 2957 



Qy 2401 

AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 2460 



I 1 I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 2958 

AGCAAAGCT GCT GC C CTC C AGGAT GTAT AT GCAGAC GC CT C C ATT GGCAAC GT GAC GGGC 3017 
Qy 2461 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 2520 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3018 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 3077 
Qy 2521 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2580 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 3078 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 3137 

Qy 2581 CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAA 2629 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3138 CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAA 3186 



RESULT 2 
AY401283 

LOCUS AY401283 1788 bp DNA linear GSS 15- 

DEC-2003 

DEFINITION Homo sapiens SLC8A3 gene, VIRTUAL TRANSCRIPT, partial 
sequence, 

genomic survey sequence, 
ACCESSION AY401283 

VERSION AY401283.1 GI: 39757272 

KEYWORDS GSS. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 1788) 

AUTHORS Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., 
Kejariwal, A. , 

Todd, M. A., Tanenbaum, D.M. , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
TITLE Inferring nonneutral evolution from human- chimp-mouse 

orthologous 

gene trios 

JOURNAL Science 302 (5652), 1960-1963 (2003) 
PUBMED 14671302 
REFERENCE 2 (bases 1 to 1788) 

AUTHORS Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., 
Kejariwal, A. , 

Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T . J. , Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
TITLE Direct Submission 

JOURNAL Submitted ( 16-NOV-2003) Celera Genomics, 45 West Gude Drive, 



Rockville, MD 20850, USA 
COMMENT This sequence was made by sequencing genomic exons and 

ordering 

them based on alignment. 
FEATURES Location/Qualifiers 
source 1. .1788 

/organism= ,f Homo sapiens" 
/ mo l_type=" genomic DNA" 
/db_xref="taxon: 9606" 
gene <1. .>1788 

/gene="SLC8A3 M 
/locus_tag="HCM0839" 

ORIGIN 

Query Match 64.6%; Score 1786; 4; DB 9; Length 1788; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1787; Conservative 0; Mismatches 1; Indels 0; 
Gaps 0; 

Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 
Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 12 0 

Qy 121 

ACAGGGCAGAACAAT GAGT C CT GT T CAGGGT CAT C GGACTGCAAGGAGGGTGT CAT C CT G 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 

ACAGGGCAGAACAAT GAGT C CT GT TCAGGGT CAT CGGACTGCAAGGAGGGTGT CAT C CT G 180 
Qy 181 

CCAAT CT GGTACCC GGAGAACC CTTC C CT T GGGGACAAGAT T GCCAGGGT CAT T GT CT AT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 181 

CCAAT CT GGTACCCGGAGAACC CT T C C CTT GGGGACAAGATT GC CAGGGT CAT T GT CT AT 240 
Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 
Qy 301 

T CT AT T GAAGT CAT CACCT CT CAAGAGAGGGAG GT GACAAT TAAGAAAC CCAAT GGAGAA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 

T CTATTGAAGT CATCACCTCTCAAGAGAGGGAGGT GACAAT TAAGAAAC CCAAT GGAGAA 360 



Qy 361 

AC CAGCACAACCACT ATT C GGGT CT GGAAT GAAACT GT CT C CAACCT GACCCTTAT GGCC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 

AC CAGC ACAACCACTATT C GGGT CT GGAAT GAAACT GT CT C CAACCT GAC CCT TAT GGC C 420 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I 
Db 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 
Qy 481 

AT T GCT GGT GAT CT GGGAC CT T CT AC CAT T GT AGGGAGT GCAGCCTT CAACAT GT T CAT C 540 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I i I I I I 
Db 481 

AT T GCT GGT GAT CT GGGAC CT T CT ACCAT T GT AGGGAGT GCAGC CTT CAACAT GTT CAT C 540 
Qy 541 

AT C ATT GGCAT CT GT GTCT AC GT GAT C C C AGACGGAGAGACTC GCAAGAT CAAGC ATCT A 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 

AT CAT T GGC AT CT GT GTCT ACGT GAT C C CAGAC GGAGAGACT C GCAAGAT CAAGCAT CT A 600 
Qy 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 

C GAGT CT TCT TCATC ACC GCT GCTT GGAGT AT CT TT GCCT ACAT CT GGCT CT AT AT GAT T 660 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I 
Db 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 
Qy 721 

T T T CCAGT GT GT GTC CTT CTGGCCT GGGT GGCAGATAAAC GACT GCT CTT CTACAAAT AC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 

TTTCCAGTGTGTGTCCTT CTGGCCT GGGT GGCAGATAAAC GACT GCT CTT CTACAAAT AC 780 
Qy 781 

AT GCACAAAAAGT AC C GC ACAGACAAAC AC C GAGGAATT AT CAT AGAGACAGAGGGT GAC 840 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 

AT GCACAAAAAGT ACC GCACAGACAAAC AC C GAGGAAT TAT CAT AGAGAC AGAGGGTGAC 840 
Qy 841 

CAC C CTAAGGGC ATT GAGAT GGAT GGGAAAAT GAT GAATT C C C ATTTT CT AGATGGGAAC 900 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 841 

C AC CCTAAGGGCATT GAGAT GGAT GGGAAAAT GAT GAATT C CC AT TT T CT AGAT GGGAAC 900 
Qy 901 

CT GGT GC CC CT GGAAGGGAAGGAAGT GGAT GAGT C CC GC AGAGAGAT GAT C CGGATT CT C 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 

CT GGT GC CC CT GGAAGGGAAGGAAGT GGAT GAGT C CCGCAGAGAGAT GAT CCGGATT CT C 960 
Qy 961 

AAGGAT CT GAAGCAAAAAC AC CC AGAGAAGGACTTAGATCAGCT GGT GGAGAT GGCCAAT 1020 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 

AAGGAT CT GAAGCAAAAAC AC CC AGAGAAGGACTTAGATCAGCT GGT GGAGAT GGCCAAT 1020 
Qy 1021 

T ACT AT GCT CTTT C CCAC CAACAGAAGAGCC GC GC CTT CT ACC GT AT C CAAGC CACT CGT 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 

TACT AT GCT CT TT CCCAC CAACAGAAGAGC CGC GC CTT CT ACC GT AT CCAAGC CACT C GT 1080 
Qy 1081 

AT GAT GACT GGT GCAGGCAAT AT C CT GAAGAAAC AT GC AGCAGAACAAGCCAAGAAGGCC 114 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 

AT GAT GACT GGT GCAGGCAAT AT CCT GAAGAAAC AT GCAGCAGAACAAGC CAAGAAGGCC 1140 
Qy H41 

T CCAGCAT GAGCGAGGT GCAC AC CGAT GAGC CT GAGGACTTTAT T T C CAAGGT CTT CTTT 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 

T CCAGCAT GAGCGAGGT GCACACC GAT GAGCCT GAGGACT T TAT T T CCAAGGT CTT CTTT 1200 
Qy 1201 

GAC CCAT GT T CTT AC CAGTGC CT GGAGAACT GT GGGGCT GT ACT C CT GACAGT GGT GAGG 1260 

I I \ I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1201 

GACC CAT GTT CT T ACCAGT GC CT GGAGAACT GT GGGGCTGT ACT C CT GACAGT GGT GAGG 1260 
Qy 1261 

AAAGGGGGAGAC AT GT CAAAGAC CAT GT AT GT GGACT ACAAAAC AGAGGAT GGT T CT GCC 1320 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 1261 

AAAGGGGGAGACAT GT CAAAGAC CAT GT AT GT GGACT ACAAAACAGAGGAT GGTTCT GC C 1320 
Qy 1321 

AATGC AGGGGCT GACTAT GAGT T C ACAGAGGGCAC GGT GGT T CT GAAGCCAGGAGAGAC C 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 

AAT GCAGGGGCT GACTAT GAGTT CAC AGAGGGCACGGT GGT T CT GAAGCC AGGAGAGACC 1380 



Qy 1381 

CAGAAGGAGTTCT CCGT GGGCATAAT T GAT GAC GACATTTTT GAGGAGGATGAACACT TC 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I 
Db 1381 

CAGAAGGAGTTCT CCGT GGGCATAAT T GAT GACGACATTTTTGAGGAGGAT GAAC ACTT C 1440 
Qy 1441 

TTT GTAAGGTTGAGCAAT GT CC GCAT AGAGGAGGAGCAGCCAGAGGAGGGGAT GC CTC CA 1500 

I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I 
Db 1441 

TTTGTAAGGTTGAGCAAT GTCC GCAT AGAGGAGGAGCAGCCAGAGGAGGGGAT GC CTC CA 1500 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 
Qy 1561 

GTT ACCAT CT T GGAT GAT GAC CAT GCAGGC AT CTT CACTT T T GAAT GT GAT ACT ATT CAT 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1561 

GTT ACCAT CT T GGAT GAT GAC CAT GCAGGC AT CTT CACTT T TGAAT GTGAT ACT ATT C AT 1620 
Qy 1621 

GT CAGT GAGAGT ATT GGT GT TAT GGAGGT CAAGGT T CT GC GGACAT CAGGTGC CCGGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 

GT CAGT GAGAGT ATT GGT GTT AT GGAGGT CAAGGTTCT GCGGACAT CAGGTGC CC GGGGT 1680 
Qy 1681 

ACAGTCAT C GT C CCCTTT AGGACAGT AGAAGGGACAGCCAAGGGT GGCGGT GAGGACT TT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 

ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 1740 

Qy 1741 GAAGACACAT AT GGGGAGTT GGAATT CAAGAAT GAT GAAACT GT GAAA 1788 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1741 GAAGACACAT AT GGGGAGTT GGAATT CAAGAAT GAT GAAACT GT GTAA 1788 



RESULT 3 
AY401284 
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DEC-2003 

DEFINITION Pan troglodytes SLC8A3 gene, VIRTUAL TRANSCRIPT, partial 
sequence, 

t\ genomic survey sequence . 
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VERSION AY401284.1 GI:39757273 
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REFERENCE 2 (bases 1 to 1788) 
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This sequence was made by sequencing genomic exons and 



COMMENT 
ordering 

FEATURES 

source 



gene 



them based on alignment. 

Location/Qualifiers 
1. .1788 

/organism="Pan troglodytes" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9598" 
<1. .>1788 
/gene="SLC8A3" 
/locus_tag="HCM0839" 

ORIGIN 

Query Match 63.0%; Score 1741.4; DB 9; Length 1788; 

Best Local Similarity 97.8%; Pred. No. 0; 
Matches 1748; Conservative 0; Mismatches 
Gaps 0; 



40; Indels 



0; 



Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I II II I I I I I 
Db 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 
Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 
I I I I I I I I I I I 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 

TTTGTGCTCTTTCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

Qy 121 

AC AGGGCAGAACAAT GAGT CCT GT T C AGGGT C AT CGGACT GCAAGGAGGGTGT CAT C CT G 180 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 

AC AGGGCAGAACAAT GAGT CCT GT T CAG GGT CAT C GGACT GCAAGGAGGGTGT CAT CCT G 180 



Qy 181 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 

CCAAT CT GGT AC C C GGAGAAC CCT T C C CTT GGGGACAAGATT GC CAGGGT CAT T GT CT AT 240 
Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II 

Db 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCG 300 
Qy 301 

T CT ATT GAAGTC AT CACCT CT CAAGAGAGGGAGGT GACAATTAAGAAAC CCAAT GGAGAA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I 
Db 301 

TCTAT T GAAGT CAT CACCT CT CAAGAGAGGGAGGT GACGATTAAGAAAC CCAAT GGAGAA 360 
Qy 361 

ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 420 

I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 

AC C AGC ACAAC C ACT ATT C GAGT CTGGAAT GAAACT GT CT C CAAC CT GACC CTT AT GGC C 420 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 4 80 
Qy 481 

AT T GCT GGT GAT CT GGGAC CTTCT ACCATT GT AGGGAGT GCAGCCTT CAAC AT GT T CAT C 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 481 

AT T GCT GGT GAT CT GGGAC CT TCT ACCATT GTAGGGAGT GCAGCCTT CAAC AT GTT CAT C 540 
Qy 541 

AT CAT T GGCATCT GTGT CT ACGT GAT CCCAGACGGAGAGACT C GCAAGATCAAGCATCT A 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 541 

AT C ATT GGC AT CT GT GT CT AC GT GAT C CCAGAC GGAGAGACT CGCAAGAT CAAGCACCT A 600 
Qy 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I I I I I II I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 
Qy 781 

AT GCACAAAAAGTACCGCACAGACAAACACC GAGGAATT AT CAT AGAGACAGAGGGT GAC 840 

I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 

AT GCACAAAAAGTACCGCACAGATAAACACCGAGGAATTAT CAT AGAGACAGAGGGT GAC 840 
Qy 841 

CACCCTAAGGGCATT GAGATGGATGGGAAAATGAT GAATTCCCATTTTCTAGATGGGAAC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 841 

CACCCTAAGGGCATTGAGAT GGATGGGAAAAT GAT GAATTCCCATTTTCTAGATGGGAAC 900 
Qy 901 

CTGGTGCCCCT GGAAGGGAAGGAAGTGGAT GAGTCCCGCAGAGAGAT GATCCGGATT CTC 960 

1. 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 

CT GGT GCCC CT GGAAGGGAAGGAAGTGGAT GAGT C C CGCAGAGAGAT GAT CCGGATT CTC 960 
Qy 961 

AAGGAT CT GAAGCAAAAACAC C C AGAGAAGGACTT AGATC AGCT GGT GGAGAT GGCCAAT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

III 

Db 961 

AAGGATCT GAAGCAAAAACAC CCAGAGAAGGACTTAGATCAGCT GGT GGAGAT GGCTAAT 1020 
Qy 1021 

T ACT ATGCT CT T T C C CAC CAACAGAAGAGCCGC GC CTT CT ACC GT AT C CAAGC CACT C GT 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I 
Db 1021 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGGATCCAAGCCACTCGT 1080 
Qy 1081 

AT GAT GACT GGT GCAGGCAAT AT C CT GAAGAAACAT GCAGCAGAACAAGCCAAGAAGGCC 1140 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 

ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 1140 
Qy 1141 

T C CAGCAT GAGCGAGGT GCACACC GAT GAGC CT GAGGACTTTAT TT CCAAGGT CTT CT TT 1200 

I I I I I I II I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 

T CCAGCAT GANNGAGGT GCAC AC CGAT GANNCT GAGGACT TT ATT T C CAAGGT CTT CTTT 1200 



Qy 1201 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 

I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 1201 

GACCCATGTTCTTACCAATGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 
Qy 1261 

AAAGGGGGAGACATGT CAAAGACCAT GTATGTGGACTACAAAACAGAGGATGGTTCTGCC 1320 

I I I I 1 I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 1261 

AAAGGGGGAGACATGT CAAAGACCAT GTATGTGGACTACAAAACAGAGGATGGTNNNGCC 1320 
Qy 1321 

AAT GCAGGGGCT GACT AT GAGT T CAC AGAGGGCAC GGT GGT TCT GAAGCC AGGAGAGAC C 1380 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 

AAT GCAGGGGCT GACT AT GAGTTCACAGAGGGCACGGT GGTTCTGAAGCCAGGAGAGACC 1380 
Qy 1381 

CAGAAGGAGTT CT CCGT GGGC ATAATT GAT GACGAC ATTTTTGAGGAGGATGAACACTT C 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
I II I I I I I I I I II I I I 
Db 1381 

CAGAAGGAGTTCT CC GT AGGC ATAAT TNNN GAC GAC ATTTNNNAGGAGGATGAAC ACTTN 1440 
Qy 1441 

TT T GTAAGGT TGAGCAAT GT C C GCAT AGAGGAGGAGCAGCC AGAGGAGGGGAT GCCTCC A 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1441 

NNNNNAAGGT TGAGCAAT GT CC GC AT AGAGGAGGAGCAGCCAGAGGAGGGGAT GNCTCCA 1500 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I II I 

Db 1501 

GCAATANNCAACAGTNNNNCCTTGCCTCGGGCTGTNNTAGCCTCCCCTTGTGTNGCCACA 1560 
Qy 1561 

GTT ACCAT CT TGGAT GAT GAC CAT GC AGGC AT CTT CACTTTT GAAT GTGAT ACTATTCAT 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1561 

GTT AC C ATCTT GGAT GAT GAC C AT GCAGGCAT CT T CACTTT TGAAT GT GAT ACT AT TCAT 1620 
Qy 1621 

GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 

GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 1680 
Qy 1681 

ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 1740 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 



Db 1681 

ACAGT C AT CGT CCC CTTT AGGAC AGT AGAAGGGAC AGC CAAGGGT GGCGGT GAGGACTT T 1740 



Qy 1741 GAAGACACATAT GGGGAGT T GGAATT CAAGAAT GATGAAACT GT GAAA 1788 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1741 GAAGACACATAT GGGGAGT T GGAATT CAAGAAT GAT GAAACT GT GTAA 1788 



RESULT 4 
AK044636 

LOCUS AK044636 2534 bp mRNA linear HTC 03- 

APR-2004 

DEFINITION Mus raus cuius adult retina cDNA, RIKEN full-length enriched 
library, 

clone :A930029A02 product : solute carrier family 8 
( sodium/ calcium 

exchanger), member 3, full insert sequence... 
ACCESSION AK044636 

VERSION AK044636.1 GI:26336660 

KEYWORDS HTC; CAP trapper. 
SOURCE Mus mus cuius (house mouse) 

ORGANISM Mus mus cuius 
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Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; 
Murinae; Mus. 
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Location/Qualifiers 
1. .2534 

/organism="Mus musculus" 
/mol_type="mRNA M 
/strain="C57BL/6J" 
/ db_x r e f = " FANTOM_DB : A9 3 0 0 2 9 AO 2 " 
/db_xref="taxon: 10090" 
/ cl one= " A9 3 0 0 2 9A0 2 " 
/tissue_type=" retina" 

/clone_lib="RIKEN full-length enriched mouse cDNA 

/ dev__s t age=" adul t " 
713. .>2533 

/note=" unnamed protein product; putative 
solute carrier family 8 (sodium/ calcium exchanger) , 

3 (MGDIMGI: 107976, GB | NM__0 80440, evidence: BLASTN, 

match=2394) " 
/codon_start=l 
/protein_id="BAC32013. 1" 
/db_xref="GI: 26336661" 

/translation= "MC P VQGRTMS P VRGHQT ARRVS FCQLW Y P EN P S LGD KI ARVI VY 

EVALI YMFLGVSI IADRFMAS I EVITSQEREVTI KKPNGETSTTTI RVWNETVSNLTL 

MALGS SAPEI LLSLI EVCGHGFI AGDLGPST I VGSAAFNMFI 1 1 GI CVYVI PDGETRK 

I KHLRVFFVTAAWS I FAYIWLYMI LAVFS PGWQVWEGLLTLFFFPVC VXLAWVADKR 

LLFYKYMHKKYRTDKHRGIIIETEGDHPKGIEMDGKMMNSHFLDGNFTPLEGKEVDES 

RREMI RI LKDLKQKHPEKDLDQLVEMAN YYALSHQQKS RAFYRI QATRMMTGAGNI LK 

KHAAEQAKKTSSMSEVliTDEPEDFASKVFFDPCSYQCLENCGAVLLTVVRKGGDISKT 

MYVDYKTEDGSANAGADYEFTEGTVVLKPGETQKEFSVGI I DDDI FEEDEHFFVRLSN 

WV^EEQLAEGMLPAILNSLPLPRAvTASPCVATWILDDDHAGIFTFECD 

IGVTVlEvT<VLRTSGARGTVIVPFRTV^GTAKGGGEDFEDAYGELEFKNDETVKTIRW 

VDEEEYERQENFFIALGEPKWMERGISGVRFFLNQSNSQER" 

ORIGIN 

Query Match 58.7%; Score 1624.4; DB 3; Length 2534; 

Best Local Similarity 91.7%; Pred. No. 0; 

Matches 1740; Conservative 0; Mismatches 156; Indels 2; 
Gaps 2; 

Qy 1 ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGG- 

CTGGTTAC 59 

I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I 

I I I I I I I I 



Boston, MA 
Please 



FEATURES 

source 



library" 
CDS 

member 
99%, 



Db 603 

ATGGCGTGGTTACGGCTGCACCCTCTCACCTCTGCCTTCCTCCATCTTGGGCCTGGTTAC 662 



Qy 60 

CTTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAG 119 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I III I I II I I I I I I 

I I 

Db 663 

TTTTGTGCTCTTCCTGAATTGTCTTCGAGCAGAGGCTGGTGACTCGGGGGATGTGCCCAG 722 



Qy 120 

CAC AGGGC AGAACAAT GAGT C CT GTT CAGGGT C AT CGGACT GCAAGGAGGGT GT CAT CCT 179 

I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 723 

T GCAGGGCAGAACAAT GAGT CCT GT T C GGGGT CAT CAGACT GCAAGGAGGGT GT C AT TTT 782 



Qy 180 GCCAA- 

T CT GGT ACCCGGAGAAC C CTT C C CTT GGGGACAAGAT T GC CAGGGT CAT T GT CT 238 

I I I I I I I I I I I I II 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 783 

GC CAACT CT GGT ATCCAGAGAAC CCT T CCCT T GGGGACAAGATT GC CAGGGT CAT T GT CT 842 



Qy 239 

ATTTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGG 298 
I I I I I I I I I I I I II I I I I II I I I I I I I 1 I I I I I I I I I II I I I II I I I I 

I I I I I I I 

Db 843 

ATTTTGTGGCCCTGATATACATGTTTCTTGGGGTGTCTATCATTGCTGACCGATTCATGG 902 



Qy 299 

CAT CTATTGAAGT CAT CAC CT CT CAAGAGAGGGAGGT GACAAT TAAGAAACC CAAT GGAG 358 
I I I I I I I I II I II I I I II II I I I I II I I I I I I I I I I II I II I I 

I I I I I I I I I I 

Db 903 

CATCTATTGAAGT CATTACTT CCCAAGAGAGGGAAGT GACCATCAAGAAGCCCAATGGAG 962 



Qy 359 

AAACCAGCACAACCACTATTCGGGT CT GGAAT GAAACT GT CT C CAAC CT GAC CCT T AT GG 418 

I I II I I I I I II I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 963 

AGACC AGCACAACTACAATT CGGGT AT GGAAT GAAACT GT CT C CAAT CT GAC C CT GAT GG 1022 



Qy 419 

C C CT GGGTT C CT CT GCTC CTGAGAT ACT C CT CT CTT TAATT GAGGT GT GT GGT CAT GGGT 478 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 1 1023 

CCCTGGGCTCTTCTGCTCCAGAGATTCTCCTGTCTTTAATTGAGGTGTGTGGTCACGGGT 1082 

Qy v'-v' 479 

TCATTGCtGGTGATCT GGGACCTT CTACCATTGTAGGGAGTGCAGCCTTCAACAT GTT CA 538 

I I I I I I I I I I I I I I I I I I I I I I MINIM II II 
I I I I I M I I I I I I M I I I I I M 
Db 1083 

T CATT GCT GGT GATCT GGGAC CAT CT AC CAT CGTT GGC AGT GC AGCCTT CAACAT GTT CA 1142 



Qy 539 



T CAT CATT GGCAT CT GTGT CT AC GT GAT C C C AGAC GGAGAGACT CGCAAGAT CAAGCAT C 598 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I i I I I I II I I I I I I I 

I 

Db 1143 

T CAT CATTGGC AT CT GT GT CT AT GT GAT C C CAGAT GGGGAGACT CGAAAGAT CAAGCACC 1202 
Qy 599 

TACGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGA 658 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I I I I I I 
Db 1203 

TGCGAGTCTTCTTCGTCACGGCTGCTTGGAGCATCTTCGCCTACATTTGGCTCTATATGA 12 62 
Qy 659 

TTCTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCT 718 
I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

I I I I I I I I I I 

Db 1263 

TCCTGGCAGTCTTCTCTCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTTACTCTCTTCT 1322 
Qy 719 

TCTTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAAT 778 

I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
I I I I I I I II I I I I I I I I I I 
Db 1323 

TCTTTCCCGTGTGTGTCCTGCTGGCTTGGGTGGCAGATAAGCGACTGCTCTTCTACAAAT 1382 
Qy 779 

AC AT GCACAAAAAGT ACCGC AC AGACAAACACCGAGGAAT TAT C AT AGAGACAGAGGGT G 838 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I I I I I I 
Db 1383 

ACAT GC ACAAAAAAT ACC GCACAGATAAAC ACC GAGGAATT AT CAT T GAGACAGAGGGT G 1442 
Qy 839 

AC CAC CCTAAGGGCAT T GAGAT GGAT GGGAAAAT GAT GAATT CC CATTTT CT AGAT GGGA 898 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I II I I I I I I I I I I 

Db 1443 

AC CACCCTAAGGGCATTGAGAT GGATGGGAAAAT GAT GAATT CT CACTT T CT AGAT GGGA 1502 
Qy 899 

ACCT GGT GCC CCT GGAAGGGAAGGAAGT GGAT GAGT CCC GCAGAGAGAT GAT C CGGATT C 958 
III II I I II I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I I II I I 
Db 1503 

ACTTT AC ACCTTT GGAAGGAAAGGAGGT AGAT GAAT CT CGCAGGGAAAT GAT CCGGAT T C 1562 
Qy 959 

TCAAGGAT CTGAAGCAAAAACACCCAGAGAAGGACTTAGAT CAGCTGGTGGAGATGGCCA 1018 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I II I I I I I I I I I I I I I 
Db 1563 

TAAAGGATCT GAAACAAAAACACCC AGAAAAGGACCT AGAT CAGCT GGTGGAGAT GGCCA 1622 
Qy 1019 

ATT ACT ATGCT CT TT C CCACCAACAGAAGAGC C GCGCCT T CT ACCGT AT CCAAGCCACT C 1078 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I 

I 

Db 1623 



AT T ACT ATGCT CT T TCCC AT CAAC AGAAGAGCCGTGCT T T CT ACCGCAT CCAAGC CACCC 1682 



Qy 1079 

GTATGATGACT GGTGCAGGCAATAT CCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGG 1138 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II 

I I I I I I I I I I I I 
Db 1683 

GGATGATGACTGGTGCGGGCAATATACTTAAGAAGCATGCAGCCGAGCAAGCCAAGAAGA 1742 
Qy 1139 

CCT CC AGCAT GAGC GAGGT GCACACC GATGAGC CT GAGGACTTT ATTT C CAAGGTCTT CT 1198 
I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I II 

I I I I I I I I I I 
Db 1743 

CCT CC AGCAT GAGCGAGGT GCAT AC C GAT GAGC CGGAGGACT TT GCCT CTAAGGT CTT CT 18 02 
Qy 1199 

TTGACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGA 1258 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I 

Db 1803 

TT GACC CAT GTT CT TAT C AGT GC CT GGAGAACT GT GGAGCT GTCCT CCT GAC CGT GGT GA 1 8 62 
Qy 1259 

GGAAAGGGGGAGACATGTCAAAGACCATGTATGT GGACTACAAAACAGAGGATGGTTCT G 1318 

I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 

I 

Db 1863 

GGAAAGGGGGAGAT AT AT C CAAGACCAT GT ACGT GGACT ACAAAACAGAGGACGGCTCC G 1922 
Qy 1319 

CCAAT GC AGGGGCT GACT AT GAGTT C AC AGAGGGC ACGGT GGTT CTGAAGC CAGGAGAGA 1378 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I II I I I I I I I I I I I I I 
Db 1923 

CCAATGCAGGGGCAGACTATGAGTTCACAGAGGGCACT GT GGTTCTGAAGCCAGGAGAGA 1982 
Qy 1379 

C C CAGAAGGAGT T CT C C GT GGGC ATAATT GAT GAC GACAT TTTT GAGGAGGAT GAACACT 1438 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1983 

CC CAGAAGGAGT T CT CT GT GGGCATC AT T GAT GAT GACATTT TT GAGGAGGAT GAACACT 2042 
Qy 1439 

TCTTTGTAAGGTTGAGCAAT GT CCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTC 1498 



Db 2043 

TCTTTGTGAGGCTGAGCAATGTCCGTGTAGAAGAGGAGCAGCTGGCGGAGGGGATGCTCC 2102 
Qy 1499 

CAGCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCA 1558 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I 
Db 2103 

CAGCAATACTCAATAGTCTTCCTTTGCCTCGGGCT GTCCT GGCCTCCCCTTGTGTGGCCA 2 1 62 
Qy 1559 

CAGTT AC CAT CTT GGAT GAT GAC CAT GC AGGCAT CT T C ACT T TT GAAT GT GAT ACTATT C 1618 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

1 1 1 1 

Db 2163 

CAGTAACCATCTT GGATGATGACCATGCAGGAATTTTCACTTTTGAAT GTGATACCATTC 2222 
Qy 1619 

ATGTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGG 1678 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I II I I I I I I 

I I I 

Db 2223 

AT GT CAGT GAAAGT ATT GGT GTT AT GGAAGT CAAGGT T TT GAGGACAT CAGGTGCCAGGG 2282 
Qy 1679 

GTACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACT 1738 
I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I I I I I I 

Db 2283 

GC AC AGT CAT CGT C CCT TT TAGGAC AGT AGAAGGGACAGCCAAGGGTGGT GGCGAGGACT 2342 
Qy 1739 

TTGAAGACACATAT GGGGAGTTGGAATT CAAGAATGAT GAAACTGTGAAAACCATAAGGG 1798 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I II I I I I 
Db 2343 

T T GAAGAT GCAT AT GGGGAGCT GGAGTT CAAGAAT GAT GAAACAGT GAAAACCATAAGGG 24 02 
Qy 1799 

TTAAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTG 1858 

I I I I I I I I I I I I I II I I I I I I I I I I I I 
I I I I I I I I I I I II I I I I I I I I I I I I II I I I I 
Db 2403 

TTAAAATAGT AGAT GAGGAGGAGTAC GAGAGGCAAGAGAAT T T CTT CAT TGC CCTTGGT G 2462 

Qy 1859 AACCGAAATGGAT GGAACGT GGAATAT CAGAT GT GACA 1896 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2463 AACCGAAATGGAT GGAACGT GGAATAT CAGGT GTGAGA 2500 



linear GSS 15 



RESULT 5 
AY401285 

LOCUS AY401285 1788 bp DNA 

DEC-2003 

DEFINITION Mus musculus SLC8A3 gene, VIRTUAL TRANSCRIPT, partial 
sequence, 

genomic survey sequence. 
ACCESSION AY401285 

VERSION AY401285.1 GI: 39757274 

KEYWORDS GSS. 

SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; 
Murinae; Mus. 

REFERENCE 1 (bases 1 to 1788) 

AUTHORS Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., 
Kejariwal, A. , 

Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky,J.J. 
Adams, M.D. and Car gill, M. 



TITLE 
orthologous 

JOURNAL 
PUBMED 
REFERENCE 

AUTHORS 
Kejariwal, A 



TITLE 
JOURNAL 

COMMENT 
ordering 

FEATURES 

source 



gene 



ORIGIN 



Inferring nonneutral evolution from human- chimp-mouse 
gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 1788) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., 
• / 

Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T . J. , Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
Direct Submission 

Submitted ( 16-NOV-2003) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and 

them based on alignment. 

Location/Qualifiers 
1. .1788 

/organism="Mus musculus" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 10090" 
<1. .>1788 
/gene="SLC8A3" 
/locus_tag="HCM0839" 



Query Match 55.9%; Score 1546.4; DB 9; Length 1788; 

Best Local Similarity 91.6%; Pred. No. 0; 

Matches 1637; Conservative 0; Mismatches 151; Indels 0; 
Gaps 0 ; 

Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I I I I I I I I I II 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 1 

ATGGCGTGGTTACGGCTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACT 60 



Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I I I II I I I I I I I I I I I II I II I I I I I I I I I I I I I I III I I I II I I I I I 

I I 

Db 61 

TTTGTGCTCTTCCTGAATTGTCTTCGAGCAGAGGCTGGTGACTCGGGGGATGTGCCCAGT 120 



Qy 121 

ACAGGGCAGAACAAT GAGT C CT GTT CAGGGT CAT C GGACT GCAAGGAGGGT GT CAT C CT G 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 121 

GCAGGGCAGAACAAT GAGT CCT GTT CGGGGT CAT CAGACT GCAAGGAGGGT GT C AT TTTG 180 



Qy 181 

C CAAT CT GGT ACCC GGAGAACC CTT C C CTT GGGGACAAGAT T GCCAGGGT CATT GT CT AT 240 

I I I I I I I I I I I II 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 

C CAAT CTGGTATCCAGAGAAC CCT TCCCTTGGGGACAAGATT GCCAGGGT CATT GTCTAT 240 



Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 
I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I i I I I I I I I I I I I I I I I 

I I I I I II I I 

Db 241 

TTTGTGGCCCTGATATACATGTTTCTTGGGGTGTCTATCATTGCTGACCGATTCATGGCA 300 
Qy 301 

T CT AT TGAAGT CAT CACCT CT CAAGAGAGGGAGGT GACAATTAAGAAACC CAAT GGAGAA 360 
I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I II I I I I I 

I I I I I I I I I I I 

Db 301 

T CT ATTGAAGT CATT ACTT CCCAAGAGAGGGAAGT GACC AT CAAGAAGCC CAAT GGAGAG 360 
Qy , 361 

ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 420 

I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 361 

ACCAGCACAACT ACAATTCGGGTATGGAAT GAAACTGTCTCCAAT CT GACCCT GATGGCC 420 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

I I I I I I 

Db 421 

CTGGGCTCTTCTGCTCCAGAGATTCTCCTGTCTTTAATTGAGGTGTGTGGTCACGGGTTC 480 
Qy 481 

ATT GCTGGT GAT CT GGGACCTT CT AC CATT GT AGGGAGT GC AGCCT T CAACAT GTT CATC 540 

I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I 
I I I I I I II I I I I I I I I I I I I I I I I 
Db 481 

ATT GCTGGT GAT CT GGGACC AT CT AC CAT C GT T GGCAGT GCAGCCT T CAACAT GTT CAT C 540 
Qy 541 

ATCATTGGCAT CTGT GTCTACGTGATCCCAGACGGAGAGACT CGCAAGATCAAGCATCTA 600 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

I I 

Db 541 

ATCAT TGGCAT CT GT GTCT ATGT GAT C C CAGAT GGGGAGACT C GAAAGAT CAAGCACCT G 600 
Qy 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I I I I I I I 
Db 601 

CGAGTCTTCTTCGTCACGGCTGCTTGGAGCATCTTCGCCTACATTTGGCTCTATATGATC 660 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 
I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I 

I I I II I I I I I I I 
Db 661 

CTGGCAGTCTTCTCTCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTTACTCTCTTCTTC 720 

Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I 



Db 721 

TTTCCCGTGTGTGTCCTGCTGGCTTGGGTGGCAGATAAGCGACTGCTCTTCTACAAATAC 780 
Qy 781 

AT GCACAAAAAGT AC C GC ACAGACAAACAC CGAGGAATTAT C AT AGAGACAGAGGGTGAC 840 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I 

Db 781 

ATGC ACAAAAAAT AC C GCACAGATAAACAC CGAGGAAT TAT CAT T GAGACAGAGGGT GAC 840 
Qy 841 

CAC C CTAAGGGC ATT GAGAT GGAT GGGAAAAT GAT GAATT CCCATTTT CT AGATGGGAAC 900 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
I I I I I I I I I I I I I I I 

Db 841 

CAC C CTAAGGGCATT GAGATGGAT GGGAAAAT GAT GAATT CTCACTTT CT AGATGGGAAC 900 
Qy 901 

CT GGTGC CCCT GGAAGGGAAGGAAGT GGAT GAGT CCC GCAGAGAGAT GAT CCGGATT CT C 960 
I II I I I I I I I I I I I I I I I I I II I I I I I I I I I 

I I I I I I I I I I I I I I 
Db 901 

T TT ACAC CTT TGGAAGGAAAGGAGGT AGAT GAAT CT C GCAGGGAAAT GAT CCGGATT CTA 960 
Qy 961 

AAGGAT CT GAAGCAAAAACAC C CAGAGAAGGACTT AGAT CAGCT GGT GGAGAT GGC CAAT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 

AAGGAT CTGAAACAAAAACACCCAGAAAAGGAC CT AGAT CAGCT GGT GGAGAT GGCCAAT 1020 
Qy 1021 

TACT AT GCT CTT T CC CACCAACAGAAGAGC C GC GCCTT CT ACC GTAT CCAAGCCACT C GT 1080 

I I I I I I I I I I II I I I I I I II I I I I I I II II I II I I I I II I I II I I I I I I I II 

I I 

Db 1021 

TACTATGCTCTTTCCCATCAACAGAAGAGCCGTGCTTTCTACCGCATCCAAGCCACCCGG 1080 
Qy 1081 

AT GAT GACT GGT GCAGGCAAT AT C CT GAAGAAACAT GCAGC AGAACAAGC CAAGAAGGC C 1140 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I II I I I I I I I I I 

I I 

Db 1081 

AT GAT GACT GGT GCGGGCAAT AT ACTTAAGAAGCAT GCAGCCGAGCAAGCCAAGAAGACC 1140 
Qy 1141 

T CC AGCAT GAGC GAGGTGC ACACCGAT GAGCCT GAGGACT T TAT TT C CAAGGT CTT CT TT 1200 
I I II I I I I I I II I I I I I II I I I I I II II I I I II II I I I I I II 

I I I I I I I I I I II 
Db 1141 

T C CAGC AT GAGC GAGGTGC AT ACC GAT GAGCCGGAGGACTTT GC CT CTAAGGT CTT CTT T 1200 
Qy 1201 

GAC C CAT GTT CTT ACCAGT GCCT GGAGAACT GT GGGGCT GT ACT CCT GACAGT GGT GAGG 1260 
I I I I I I I I I I I I I I I I I I I I II I I II I I I I I II I I I I I I I I I I I I I I 

I I I I I I I II 

Db 1201 

GACCC AT GTT CTT AT CAGT GCCTGGAGAACT GT GGAGCT GT CCT C CT GAC CGT GGT GAGG 1260 



Qy 1261 



AAAGGGGGAGACAT GTCAAAGAC CAT GTAT GT GGACT ACAAAACAGAGGAT GGTT CT GCC 1320 

I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II II 

I I I 

Db 1261 

AAAGGGGGAGAT AT ATC CAAGACCAT GT ACGT GGACT ACAAAACAGAGGAC GGCT C CGCC 1320 
Qy 1321 

AAT GCAGGGGCT GACT AT GAGT T CAC AGAGGGCACGGT GGT T CT GAAGC CAGGAGAGACC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I II 
Db 1321 

AAT GC AGGGGCAGACT AT GAGTT C AC AGAGGGC ACT GT GGT T CT GAAGCCAGGAGAGACC 1380 
Qy 1381 

CAGAAGGAGT T CT CCGT GGGC ATAATT GAT GAC GACATTTTT GAGGAGGAT GAACACTTC 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1381 

CAGAAGGAGTTCTCTGTGGGCATCATTGATGATGACATTTTTGAGGAGGAT GAACACTT C 1440 
Qy 1441 

TTT GTAAGGTT GAGCAAT GT CCGCATAGAGGAGGAGCAGC CAGAGGAGGGGAT GC CT CC A 1500 

I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

III 

Db 1441 

TTT GT GAGGCT GAGCAAT GT CC GT GT AGAAGAGGAGCAGCT GGC GGAGGGGAT GCT CCCA 1500 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560' 

MINI I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I 1 I I I I I I I I I I I I I 
Db 1501 

GCAATACTCAATAGTCTTCCTTTGCCTCGGGCTGTCCTGGCCTCCCCTTGTGTGGCCACA 1560 
Qy 1561 

GTTACCATCTT GGAT GAT GACCATGCAGGCATCTT CACTTTT GAAT GT GATACTATTCAT 1620 

II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I 

I I I I I I 

Db 1561 

GTAAC CAT CT T GGAT GAT GAC CAT GCAGGAATTTT C ACTTT T GAAT GT GAT ACCATT C AT 1620 
Qy 1621 

GT CAGT GAGAGT ATT GGT GT T AT GGAGGTCAAGGTTCT GCGGAC AT C AGGT GC C C GGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I II 

I I I I 

Db 1621 

GT CAGTGAAAGT ATT GGT GTT AT GGAAGT CAAGGT TTT GAGGACAT CAGGT GCC AGGGGC 1680 
Qy 1681 

ACAGT CAT C GT C CC CTTT AGGACAGT AGAAGGGACAGC CAAGGGT GGCGGT GAGGACTT T 174 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II 

I I I I I I I I I 

Db 1681 

ACAGT CAT CGT CC CTTTT AGGACAGT AGAAGGGACAGC CAAGGGT GGT GGC GAGGACTTT 1740 



Qy 1741 GAAGACACAT AT GGGGAGTT GGAATT CAAGAAT GAT GAAACT GT GAAA 1788 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I III II 
Db 1741 GAAGAT GCAT AT GGGGAGCT GGAGTT CAAGAAT GAT GAAAC AGT GTAA 1788 



RESULT 6 
AK035163 

LOCUS AK035163 4374 bp mRNA linear HTC 03- 

APR-2004 

DEFINITION Mus musculus 12 days embryo embryonic body between diaphragm 
region 

and neck cDNA, RIKEN full-length enriched library, 
clone: 9430095C22 

product: SODIUM/ CALCIUM EXCHANGER 2 PRECURSOR (NA(+) /CA(2+) - 

EXCHANGE 

PROTEIN 2) homolog [Rattus norvegicus] , full insert 

sequence . 

ACCESSION AK035163 

VERSION AK035163.1 GI:26084435 

KEYWORDS HTC; CAP trapper. 
SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; 
Murinae; Mus. 
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6 (bases 1 to 4374) 

Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., 

Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W. , 
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Koya,S. f Kurihara,C, Matsuyama, T. , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura, K. , Numazaki,R., Ohno,M., 

Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., 

Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A. , Takahashi, F. , Takaku- 

Takeda,Y., Tanaka,T., Tomaru,A. , Toya,T., Yasunishi,A. , 
Muramatsu,M. and Hayashizaki, Y. 
Direct Submission 

Submitted ( 16- JUL-2001) Yoshihide Hayashizaki, The Institute 

Physical and Chemical Research (RIKEN) , Laboratory for 

Exploration Research Group, RIKEN Genomic Sciences Center 

RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, 

Kanagawa 230-0045, Japan (E-mail : genome-res @gsc . riken. jp, 
URL :http: //genome. gsc.riken. jp/, Tel : 81-45-503-9222, 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in 

Genomic Sciences Center and Genome Science Laboratory in 



Hirozane,T. , 
Kasukawa,T. , 

Ohsato, N. , 
Sakazume,N. , 

Akahira, S . , 

TITLE 
JOURNAL 

of 

Genome 
(GSC), 
Yokohama, 

COMMENT 
Riken 
RIKEN. 

Division of Experimental Animal Research in Riken 
contributed to 

prepare mouse tissues. 

Please visit our web site for further details. 
URL:http: / /genome. gsc. riken. jp/ 
URL : http : // f antom. gsc . riken . jp/ . 
FEATURES Location/Qualifiers 
source 1. .4374 

/organism= f, Mus mus cuius" 

/mol_type= M mRNA" 

/strain="C57BL/6J" 



region and 

library" 

misc_feature 

ID, 

ORIGIN 



/ db__xr e f = " FANTOM_DB : 9430095C22" 
/db_xref="taxon: 10090" 
/clone="9430095C22" 

/tissue_type="embryonic body between diaphragm 
neck" 

/clone_lib="RIKEN full-length enriched mouse cDNA 

/dev_stage="12 days embryo" 
281. .3046 

/note="SODIUM/CALCIUM EXCHANGER 2 PRECURSOR 
(NA(+) /CA(2 + ) -EXCHANGE PROTEIN 2) homolog [Rattus 
norvegicus] ( SWISSPROT | P48768 , evidence: FASTY, 97% 

100%length, match=2763) 
putative" 



Query Match 45.5%; Score 1258.2; DB 3; Length 4374; 

Best Local Similarity 68.7%; Pred. No. 0; 

Matches 1829; Conservative 0; Mismatches 783; Indels 51; 
Gaps 5; 

Qy 136 

GAGT C CT GT T CAGGGT CAT C GGACT GCAAGGAGG GTGT CAT CCT GCCAAT CTGGT AC C CG 195 

II III I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 401 

GAAGGCTGCCAAGGTTCCTACCGCTGCCAACCAGGTGTGCTGCTGCCTGTGTGGGAACCC 460 



Qy 196 

GAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTATTTTGTGGCCCTGATA 255 

III I I I I II II II I I I I I I I I I I III I I II II I I I I I I I I I II 

I 

Db 461 

GAGGACCCATCGCTGGGCGACAAGGTTGCACGGGCCGTGGTGTACTTTGTGGCCATGGTC 520 



Qy 256 

T ACAT GTT CCTT GGGGT GTC CAT C ATT GCT GAC C GCTT CAT GGCAT CT ATT GAAGT CAT C 315 

I I I I I I I I I I I II I I I I I I I I I I I I I II II II I I I I I I I I I I I I I 

I I I I I I 

Db 521 

T AC AT GTT CCT GGGT GT GTCT AT CATT GCCGAT CGATTT AT GGCAT C CAT T GAGGT CAT C 580 
Qy 316 

ACCT CT CAAGAGAGGGAGGT GACAAT TAAGAAACCCAATGGAGAAACC AGCACAAC CACT 375 



Db 581 

ACAT C CAAGGAGAAAGAGAT CACC ATC ACCAAGGCAAAT GGGGAGACCAGC GT GGGC AC G 640 



Qy 376 

ATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCCTCTGCT 435 

I II I I I I I I I II II II I I I I I I I I II II I I I I I I I I I I I I I I I I 

I I 

Db 641 

GTGCGCATCTGGAACGAGACGGTGTCCAACCTTACACTCATGGCCCTGGGCTCCTCAGCG 700 



Qy 436 

CCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGATCTG 495 



II 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 

1 1 

Db 701 

CCTGAGATTCTGTTGACTGTCATCGAGGTCTGTGGCCACAACTTCCAGGCCGGTGAGCTA 760 
Qy 496 

GGAC CTT CT ACC ATT GT AGGGAGT GC AGC CTT CAACATGTT CAT CAT CAT TGGCAT CT GT 555 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 761 

GGCCCAGGCACCATCGTGGGCAGTGCCGCCTTCT^ACATGTTTGTGGTCATTGCTGTTTGT 820 
Qy 556 

GT CT AC GTGATC C CAGACGGAGAGACT CGCAAGAT CAAGC AT CT AC GAGT CTT CT T CAT C 615 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

I I 

Db 821 

GTGTATGTCATCCCGGCTGGCGAGAGCCGTAAGATCAAGCACCTGAGGGTCTTCTTTGTC 880 
Qy 616 

ACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCC 675 

II II I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I 

I I I I I I 

Db 881 

AC AGCCT C CT GGAGCATCT TT GCCT AT GT CT GGCTTT AT CT CAT TCT AGCAGT TT T CT CC 940 
Qy 676 

CCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTC 735 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I 

Db 941 

CCAGGTGTAGTCCAGGTGTGGGAGGCACTGCTCACACTGATCTTCTTCCCGGTGTGTGTG 1000 
Qy 736 

CTT CT GGCCT GGGTGGCAGATAAAC GACT GCT CTT CT ACAAAT ACAT GCACAAAAAGT AC 795 

I I I I II I I I I I I II M I I I I I I I I I I I I I I I I I III II I II I 

I I I 

Db 1001 

GT GTTT GCCT GGAT GGCGGACAAGC GACT GCT CTT CTACAAGT ACGT GT ACAAGC GCTAC 1060 
Qy 796 

CGC ACAGACAAAC AC C GAGGAAT TAT C AT AGAGACAGAGGGTGACCACC CTAAGGGCATT 855 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 1061 

CGCACCGAC CCT CGCAGT GGAATCATCAT CGGGGCAGAGGGAGACCC AC C CAAGAGCAT C 1120 

Qy 856 GAGAT GGAT GGGAAAAT GAT GAATT C C CAT TTT CT AGAT GGGAA 

CCTG 903 

I I I II I I I I I I I II I I II I II 

Db 1121 

GAGCTGGACGGCACATTCGTGGGCACTGAGGTCCCTGGCGAGCTGGGCGCATTGGGCACA 1180 
Qy 904 

GT GC CC CT GGAAGGGAAGGAAGTGGAT GAGT C C C GCAGAGAGAT GAT CCGGATT CT CAAG 963 

I II I I I I I I I I I I I I III I I I I I I I I I I I I I I 

I I I 

Db 1181 

GGT C CT GCT GAGGCGC GT GAACT AGAT GC CAGCC GGC GT GAGGT CAT CCAGAT CCT TAAG 1240 



Qy 964 

GAT CT GAAGCAAAAAC ACCC AGAGAAGGACTT AGAT CAGCT GGT GGAGAT GGC CAATT AC 1023 

II I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

I I I 

Db 1241 

GACTT GAAGCAGAAGCAC CC GGATAAGGAC CT GGAGCAGCT GAT GGGC AT C GC CAAGT AC 1300 



Qy 1024 

TATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGTATG 1083 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I 

Db 1301 

TATGCACTGCTGCACCAGCAGAAGAGCCGCGCCTTCTACCGCATCCAGGCCACGCGGCTG 1360 



Qy 1084 

AT GACT GGT GC AGGCAAT AT C CT GAAGAAACAT GCAGCAGAACAAGC CAAGAAGGCCT CC 1143 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II 

I 

Db 1361 ATGACAGGTGCGGGCAATGTGCTGCGCAGACATGCTGCGGATGCTGCCCGCAGG 

CCG 1417 



Qy 1144 

AGCAT GAGCGAGGT GC ACAC C GAT GAGC CT GAGGACTT T ATTT C CAAGGT CTT CTTT GAC 1203 
I I I I I I I II I I I I I I I I I I 

I I I I I II I I I 
Db 1418 

GGAGCCACCGAT GGT GC CCCCGAT GAT GAGGACGAT GGTGC CAGT CGCAT CTT CTTT GAG 1477 



Qy 1204 

CC AT GTT CT T AC CAGTGC CT GGAGAACT GT GGGGCT GTACT CCT GACAGT GGT GAGGAAA 1263 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1478 

CCCAGCCTCTATCACTGCCTGGAAAACTGCGGGTCAGTGCTGCTGTCCGTGGCTTGCCAG 1537 



Qy 1264 

GGGGGAGACAT GTCAAAGACCAT GTATGTGGACTACAAAACAGAGGAT GGTTCTGCCAAT 1323 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1538 

GGCGGTGAGGGCAACAGCACCTTCTACGTGGACTACCGTACCGAGGACGGTTCTGCAAAG 1597 



Qy 1324 

GCAGGGGCTGACTAT GAGTTCACAGAGGGCACGGTGGTTCT GAAGCCAGGAGAGACCCAG 1383 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1598 

GCAGGCT CCGAT TAT GAGT ACAGC GAGGGC ACGCT GGT GT T CAAGCC C GGGGAGACGCAG 1657 
Qy 1384 

AAGGAGTT CT C C GT GGG C ATAAT T GAT GAC GACAT TTT T GAGGAGGAT GAACACTT CTT T 1443 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I 

Db 1658 

AAGGACCT GC GCAT C GGGAT CAT C GAC GAC GAC AT CT T CGAGGAGGAT GAGCACTT CTT C 1717 
Qy 1444 

GT AAGGTT GAGCAAT GT C C GCAT AGAGGAGGAGCAGC C AGAGGAGGGGAT GCCT C CAGCA 1503 
I I I I I I I II III II III III I II 

Db 1718 



GTGAGGCTGCTGAACCTGCGTGTGGGCGATGCTCAGGGCATGTTCGAG 



1765 



Qy 1504 

ATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTT 1563 

I I I I I III II III III I I I I I I I 

I I 

Db 1766 — 

CCCGACGGCGGTGGGCGGCCCAAGGGGCGGCTGGTGGCGCCGCTGCTGGCCACTGTC 1822 
Qy 1564 

ACCAT CTT GGAT GAT GAC CAT GC AGGCAT CT T CACTTTT GAAT GT GAT ACT ATT CAT GT C 1623 

I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I 

I I I I I 

Db 1823 

ACCATCCTGGACGACGACCACGCGGGCATCTTCTCCTTCCAGGACCGCCTGCTGCATGTG 1882 



Qy 1624 

AGT GAGAGT AT T GGT GTT AT GGAGGT CAAGGTT CT GC GGAC ATCAGGT GCC CGGGGTAC A 1683 

I I I II I I I I I I I I I I II II I I I I I I I I I I I I I I I 

I I 

Db 1883 

AGCGAGTGCATGGGCACTGTGGATGTGCGCGTGGTTCGCAGCTCGGGCGCCCGTGGCACT 1942 



Qy 1684 

GT CAT C GTC C CCT TT AGGACAGT AGAAGGGACAGC CAAGGGT GGCGGT GAGGACTTTGAA 1743 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I 

Db 1943 

GT ACGCCTC CC CT ACC GCACAGT GGACGGCACGGC CCGT GGC GGT GGT GTACAT T ACGAG 2002 



Qy 1744 

GACACATATGGGGAGTT GGAATTCAAGAATGAT GAAACTGTGAAAACCATAAGGGTTAAA 1803 

II I ! Ill II Ill I I I I I I I I I I I I I I I I I III 

I I 

Db 2003 

GAT GCTT GT GGAGAGCT GGAGTTC GG CGAT GAT GAGAC CAT GAAAACT CTT CAGGT CAAG 2062 



Qy 1804 

AT AGT AGAT GAGGAGGAAT ACGAAAGGCAAGAGAATT T CTT CAT T GC C CTT GGT GAAC C G 1863 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2063 

AT AGT GGATGAT GAAGAGTATGAGAAGAAGGACAACTTCTTCATT GAGCTGGGCCAGCCC 2122 



Qy 1864 AAATGGATGGAACGTGGAATATCAGATGTGAC 

AGACAGGAAG 1905 

I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 2123 

CAGT GGCTTAAGCGAGGCAT CTC AGCT CTGCT ACT CAACCAAGGGAAT GGAGACAAGAAG 2182 



Qy 1906 

CT GACT ATGGAAGAAGAGGAGGCCAAGAGGAT AGCAGAGAT GGGAAAGC CAGT ATT GGGT 1965 

I III II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

I I 

Db 2183 

AT AACT GCAGAGC AGGAGGAGGC C CAGAGGAT AGCAGAGAT GGGCAAGC CAGTT CTT GGG 2242 



Qy 1966 

GAAC AC C CCAAACT AGAAGT CAT C ATT GAAGAGT CCT AT GAGTT CAAGACT ACGGT GGAC 2025 



II II 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 

Db 2243 

GAGAACAAT CGCCTCGAGGTCAT CAT CT AGGAGT CTTATGACT TTAAGAACAC GGT GGAT 2302 
Qy 2026 

AAACTGATCAAGAAGACAAACCT GGC CT T GGTT GT GGGGAC CCATT C CT GGAGGGACCAG 2085 

I I I I I II I I I I I II I I I II I I I I I I I I I I I I I I I II II I I I I I I I I 

I I I 

Db 2303 

AAACT CATAAAGAAAACGAACCT GGC CTT GGT GATT GGGACTCACT CAT GGAGGGAGCAG 2362 

Qy 2086 TT CAT GGAGGC CAT CAC C GT CAGT GC AGCA 

GGGGAT GAGGAT GAGGAT GAAT CC GGG 2142 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2363 

TTT ATAGAGGCAGTT AC C GTAAGCGCAGGGGACGAGGAGGAGGATGAGGATGGACCT C GT 2422 
Qy 2143 

GAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAG 2202 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

II I I I I I I 

Db 2423 

GAGGAGCGGCT GC CGT CCT GCT T T GACT AC GT GAT GCACTT CCT GAC GGT AT T CT GGAAA 2482 
Qy 2203 

GTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTC 2262 

II II II I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2483 

GTTCTATTCGCCTGCGTTCCACCCACGGAGTACTGCAATGGCTGGGCCTGCTTTGGTGTC 2542 
Qy 2263 

TCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGC 2322 

I I I I I I I I I I I I I I II I I I I I III I I I I II I I I I I I I I I I I Mill 

I I 

Db 2543 

TGCATCCTGGTCATTGGTGTGCTCACTGCCCTCATCGGAGACCTGGCCTCACACTTTGGG 2602 
Qy 2323 

T GC AC CATT GGT CTCAAAGATT CAGT C ACAGCT GTTGT TT T CGT GGC AT T TGGCAC CT CT 2382 
I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I II II I 

I I I I I I I I 

Db 2603 

TGCACCGTGGGCCTCAAGGACTCAGTCAACGCCGTGGTCTTCGTAGCCCTGGGCACCTCC 2 662 
Qy 2383 

GT C C CAGAT ACGTTT GCC AGCAAAGCT GCTGCCCTC CAGGAT GT AT AT GC AGACGC CT C C 2442 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I II I 

III 

Db 2663 

ATCCCTGACACGTTCGCCAGCAAGGTGGCCGCGCTGCAGGACCAGTGTGCCGACGCGTCC 2722 
Qy 2443 

ATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGG 2502 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I 

I I I I I I I I 

Db 2723 

ATCGGTAACGTGACCGGCTCCAATGCGGTGAACGTGTTCCTGGGCCTGGGTGTGGCCTGG 2782 



Qy 2503 

TCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACA 2562 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I II I 

Db 2783 

TCGGTGGCAGCGGTATACTGGGCGGTGCAGGGTCGCCCCTTCGAGGTGCGTGCAGGCACG 2842 
Qy 2563 

CTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTG 2 622 

I I I I I II I I I I I I I I I II I I I I I I I I I I II II II I I I I I I I I I I I 

I I I 

Db 2843 

CTGGCCTTCTCGGTCACGCTGTTCACCGTCTTCGCCTTCGTGTGCATCGCAGTGCTGTTG 2902 
Qy 2623 

TACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCC 2682 

II I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2903 

TACCGGCGCCGGCCGCAGATTGGCGGCGAGCTGGGCGGCCCGCGGGGACCCAAGCTGGCC 2962 
Qy 2683 

ACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCC 2742 

I I I I I I I I I II I I I I III I I I I I I I I II I I II I III II 

I I I I I 

Db 2963 

ACCACCGCTCTCTTCCTGGGCCTCTGGTTCCTCTACATTCTCTTCTCCAGCCTGGAGGCT 3022 

Qy 2743 T ATT GCT ACAT CAAGGGGTT CT A 2765 

II III I I I I I III I I I I I 

Db 3023 TACTGCCACATCCGGGGCTTCTA 3045 



linear GSS 15 



RESULT 7 
AY398961 

LOCUS AY398961 2922 bp DNA 

DEC-2003 

DEFINITION Homo sapiens SLC8A1 gene, VIRTUAL TRANSCRIPT, partial 
sequence, 

genomic survey sequence. 
ACCESSION AY398961 

VERSION AY398961.1 GI: 39754950 

KEYWORDS GSS. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
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REFERENCE 2 (bases 1 to 2922) 
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FEATURES Location/Qualifiers 
source 1. .2922 

/organism="Homo sapiens" 
/ mo l_type=" genomic DNA" 
/db_xref="taxon:9606" 
gene <1. .>2922 

/gene="SLC8Al" 
/locus_tag="HCM0065" 

ORIGIN 

Query Match 43.7%; Score 1208.8; DB 9; Length 2922; 

Best Local Similarity 66.9%; Pred. No. 0; 

Matches 1887; Conservative 0; Mismatches 762; Indels 171; 
Gaps 5; 

Qy 109 

GAC GT GC CAAGCACAGGGCAGAACAAT GAGT CCT GTT CAGGGT CAT C GGACT GCAAGGAG 168 

II II II III I I I I I II I I II I II I I I I I I I I 

I 

Db 112 

GAAAT GGAAGGAGAAGGAAAT GAAACT GGTGAAT GT ACT GGAT CAT ATT ACT GTAAGAAA 171 
Qy 169 

GGTGTCATCCTGCCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGG 228 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I 

I I 

Db 172 

GGGGTGATTTTGCCCATTTGGGAACCCCAAGACCCTTCTTTTGGGGACAAAATTGCTAGA 231 
Qy 229 

GTCATTGTCTATTTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGAC 288 

I I III I I I I I I I I I I I I II I I I I I I II I I I I I I II II I I I I I 

I I I I I 

Db 232 

GCTACTGTGTATTTTGTGGCCATGGTCTACATGTTTCTTGGAGTCTCTATCATAGCTGAT 291 
Qy 289 

C GCTT CAT GGCAT CT ATT GAAGT CAT CAC CT CT CAAGAGAGGGAGGT GACAATTAAGAAA 348 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I 

I I I I I I 

Db 292 

CGGTT C AT GT CCT CT ATAGAAGT CAT CACAT CT CAAGAAAAAGAAATAAC CATAAAGAAA 351 
Qy 349 

C C CAAT GGAGAAACC AGCACAAC CACT ATT C GGGT CT GGAAT GAAACT GT CT CCAACCT G 408 

I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I II II 

Db 352 



CC CAAT GGAGAGACC ACCAAGACAACT GT GAGGAT CT GGAATGAAACAGTTT CTAAC CT G 411 



Qy 409 

ACCCTTATGGCCCTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGT 468 

III I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I 

I II I I I 

Db 412 

ACCTTGATGGCCCTGGGATCTTCTGCTCCTGAGATTCTCCTTTCAGTAATTGAAGTGTGT 471 
Qy 469 

GGTCATGGGTTCATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTC 528 

II III I I I I III II II II II III I I I I I II II I I I I I II 

I I I 

Db 472 

GGCCATAACTTCACTGCAGGAGACCTCGGTCCTAGCACCATCGTGGGAAGTGCTGCATTC 531 
Qy 529 

AACAT GTT CATCAT CATT GGCAT CT GT GT CT ACGT GAT CCCAGACGGAGAGACT C GCAAG 588 

II I I I I I I I I I I I I I I I I I I I I I I II III I I I I I I I I I 1 I I I I I 

I I I 

Db 532 

AAT ATGTTCAT CATT ATT GCACT CT GT GT TT AT GT GGT GCCTGACGGAGAGACAAGGAAG 591 
Qy 589 

AT CAAGCAT CT AC GAGTCTT CTT CAT CAC C GCT GCTT GGAGTAT CT TT GC CT AC AT CT GG 648 

II I I I II I I II I I I I I I I I I II II II Mill I I I I I I I I I I I I I 

I I I I 

Db 592 

ATTAAGCATTTGCGTGTCTTCTTTGTGACAGCAGCCTGGAGCATCTTTGCCTACACCTGG 651 
Qy 649 

CTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTC 7 08 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 652 

CTTTACATTATTTTGTCTGTCATATCTCCTGGTGTTGTGGAGGTCTGGGAAGGTTTGCTT 711 
Qy 709 

ACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTC 7 68 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 712 

ACTTTCTTCTTCTTTCCCATCTGTGTTGTGTTCGCTTGGGTAGCGGATAGGAGACTTCTG 771 
Qy 769 

TT CT ACAAAT AC ATGCACAAAAAGT ACC GCACAGACAAACACC GAGGAATTAT CAT AGAG 828 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

I I 

Db 772 

T T TT ACAAGT AT GT CT ACAAGAGGT AT C GAGCT GGCAAGCAGAGGGGGAT GATT ATTGAA 831 

Qy 829 ACAGAGGGT GAC CAC C C 

T AAGGGCATT GAGAT GGAT GGGAAAAT GAT GAAT 879 

I I I I I I I II I I I I I I I II I I I I I I I I I I I I 

I I I 

Db 832 

CAT GAAGGAGAC AGGC CAT CTT CTAAGACT GAAAT T GAAAT GGACGGGAAAGT GGT CAAT 891 



Qy 880 

TCCCATTTTCTAGATGGGAACCTGGTGCCCCTGGAAGGGAAG- 



921 



I I 1 1 1 1 1 I 1 1 I I 1 1 1 1 III 

Db 892 

T CTCAT GTT GAAAATTT CTT AGAT GGT GCT CT GGTT CT GGAGGT GGAT GAGAGGGACCAA 951 
Qy 922 

GAAGT GGAT GAGTCCCGCAGAGAGATGAT C CGGATTCT CAAGGAT CT GAAGCAAAAAC AC 981 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 952 

GATGATGAAGAAGCTAGGCGAGAAATGGCT AGGATT CT GAAGGAACTTAAGCAGAAGCAT 1011 
Qy 982 

CCAGAGAAGGACTT AGAT C AGCT GGT GGAGAT GGC CAATT ACT AT GCTCT T TC CCAC CAA 1041 

I I I I I I I I I I I I I I I I III I I I I I I I I I I II II 

I I 

Db 1012 

CC AGATAAAGAAAT AGAGCAAT TAAT AGAAT T AGCTAACT ACCAAGT CCTAAGT C AGCAG 1071 
Qy 1042 

CAGAAGAGCC GCGC CTT CT AC C GT ATCCAAGCCACT CGT AT GAT GACT GGT GCAGGCAAT 1101 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1072 

CAAAAAAGT AGAGCATTT T AT C GCATT CAAGCT ACT CGCCT CAT GACT GGAGCT GGCAAC 1131 
Qy 1102 

AT CCT GAAGAAACAT GCAGC AGAACAAGC CAAGAAGGC CT C CAGCAT GAGC GAGGT GC AC 1161 

II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

I I 

Db 1132 

ATTTTAAAGAGGCAT GCAGCT GACCAAGCAAGGAAGGCT GT CAGCAT GCAC GAGGT CAAC 1191 
Qy 1162 ACCGAT GAGCCTG 

AGGACTTTATTTCCAAGGTCTTCTTTGACCCATGTTCTTACCAG 1218 

I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I III 

I I I 

Db 1192 

ACT GAAGT GACT GAAAAT GAC CCT GTT AGTAAGAT CTT CTTTGAACAAGGGACAT AT CAG 1251 
Qy 1219 

T GCCT GGAGAACT GT GGGGCT GT ACT C CT GACAGT GGT GAGGAAAGGGGGAGAC AT GT CA 1278 

I I II I I I I II I II I I I II I I I I I I I I I I I I I I I I I I II 

I 

Db 1252 

T GTCT GGAGAACT GT GGT ACT GT GGCCCTTAC CAT TAT CC GC AGAGGT GGT GATT T GACT 1311 
Qy 1279 

AAGAC CAT GT AT GT GGACT ACAAAACAGAGGAT GGTT CT GCCAAT GCAGGGGCT GACT AT 1338 

MM I I I I I I I II I I I I I II I I I I I I I I I I I I I II I I I I I I I I 

I I I 

Db 1312 

AACACTGT GT TT GTT GACT T CAGAAC AGAGGAT GGCACAGCAAAT GCT GGGTCT GATT AT 1371 
Qy 1339 

GAGT T CACAGAGGGCAC GGT GGTT CT GAAGCCAGGAGAGAC C C AGAAGGAGTT CT C CGT G 1398 

I I I I I I II I I I I I M II I II I I I I I I I I I II I I I I I I I II 

I I I 

Db 1372 

GAATTT ACT GAAGGAACT GT GGT GTTTAAGCCT GGT GATAC CCAGAAGGAAAT C AGAGT G 1431 



Qy 1399 

GGCAT AATT GAT GACGACATTTTT GAGGAGGAT GAACACTT CTT TGTAAGGT T GAGCAAT 1458 

II II II I I I I I II II I I I I I I I I I I I I I I I I III I I I I I 

I I I I I I 

Db 1432 

GGT AT CATAGAT GAT GAT AT CT TT GAGGAGGAT GAAAATT T CCT TGT GCATCT C AGCAAT 1491 
Qy 1459 

GT CC GCATAGAGGAGGAGCAGC CAGAGGAGGGGAT GCCT C C AGCAAT ATT CAACAGT CTT 1518 

III II II I I I I I I I I I I I III I I I I 

I 

Db 1492 GT CAAAGT AT CTT CT GAAGCTT C AGAAGAT GGCAT ACT GGAAGC 

CAATCAT 1542 

Qy 1519 

CCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGATGAT 1578 

I • II III III I Mill I I I I I I I I I I I I I 

II I I I I 

Db 1543 

GTTTCTACACTTGCTTGCCTCGGATCTCCCTCCACTGCCACTGTAACTATTTTTGATGAT 1602 
Qy 1579 

GACC AT GC AGGCATCTTCACT TT T GAAT GT GAT ACT ATT CAT GT CAGT GAGAGT ATT GGT 1638 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1603 

GACCACGCAGGC ATTTTT ACT TTT GAGGAACCT GT GACT CAT GT GAGT GAGAGCATTGGC 1662 
Qy 1639 

GTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGTACAGTCATCGTCCCCTTT 1698 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

I 

Db 1663 

AT C AT GGAGGTGAAAGT ATT GAGAACAT CT GGAGCT CGAGGAAATGTT AT CGT TC CAT AT 1722 
Qy 1699 

AGGACAGT AGAAGGGAC AGC C AAGGGT GGC GGT GAGGACTTTGAAGACACAT AT GGGGAG 1758 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1723 

AAAAC CAT CGAAGGGACT GCCAGAGGT GGAGGGGAGGATT T T GAGGACACTT GTGGAGAG 1782 
Qy 1759 

TT GGAAT T CAAGAAT GAT GAAACT GT GAAAAC CAT AAGGGT T AAAATAGTAGAT GAGGAG 1818 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 1783 

CT C GAATT C CAGAAT GAT GAAATT GT CAAAACAAT AT CAGT CAAGGTAATTGATGATGAG 1842 
Qy 1819 

GAAT ACGAAAGGCAAGAGAATTT CT T CAT T GCCCTT GGT GAAC C GAAAT GGAT GGAA 1875 

II I I II I I III I II I I I II I I I I I I I II I I I I I 

Db 1843 

GAGT AT GAGAAAAACAAGACCTT CTT CCTT GAGAT T GGAGAGCC C CGCCT GGT GGAGATG 1902 
Qy 1876 

1875 



Db 1903 

AGTGAGAAGAAAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 1962 



Qy 1876 



1875 



Db 1963 

NNNNNNN GC CAAC CT GT CTT CAGGAAGGTT C AT GCTAGAGAAC AT CC GATT CT CT CT ACT 2022 
Qy 1876 

CGTGGAAT AT C AGATGT GAC AGACAGGAAGCT GACT AT GGAAGAAGAGGAG 1926 

III I I I I I I Mill I I I I I I I 

I I I 

Db 2023 

GTAATCACCATTGCAGACGAATATGATGACAAGCAGCCACTGACCAGCAAAGAGGAAGAG 2082 
Qy 1927 

GCCAAGAGGAT AGC AGAGAT GGGAAAGCCAGT ATT GGGTGAAC ACCC CAAACT AGAAGT C 1986 

I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I 

I I I I I 

Db 2083 

GAGAGGCGC ATT GC AGAAAT GGGGC GCCCCAT CCT GGGAGAGC ACAC CAAGTT GGAAGTG 2142 
Qy 1987 

AT CATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACT GATCAAGAAGACAAAC 2046 
I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I II 

I II I I I I I I I I I 
Db 2143 

ATCATTGAAGAATCCTAT GAATTCAAGAGTACTGT GGACAAACTCATTAAGAAGACAAAC 2202 
Qy 2047 

CT GGC CTT GGTT GT GGGGACC CATT CCT GGAGGGACC AGTT CAT GGAGGC CAT CACC GT C 2106 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2203 

CT GGC CCT T GT GGTT GGGACTAACAGCT GGAGAGAAC AGTT CATT GAAGCT AT C ACT GT C 22 62 
Qy 2107 

AGTGCAGC AGGGGAT GAGGAT GAGGAT GAAT C C GGGGAGGAGAGGCT GCCCTC CT GCT TT 2166 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

I I 

Db 2263 

AGT GCTGGGGAAGATGAT GAC GAC GAT GAAT GT GGGGAAGAGAAGCT GCCCTCCT GTTT C 2322 
Qy 2167 

GACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTGCCCCCC 2226 

II Mill I I I I I I I I I II I I II I II I I I I I I I II II II II I II I II 

I I I I I 

Db 2323 

GATTACGTGATGCACTTTCTGACTGTGTTCTGGAAGGTCCTGTTTGCCTTCGTCCCCCCT 2382 
Qy 2227 

ACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTC 2286 



Db 2383 

ACTGAATACTGGAATGGCTGGGCGTGTTTCATTGTCTCCATCCTCATGATTGGCCTACTG 2442 
Qy 2287 

ACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGATTCA 2346 
II II I I I I I I I I I I I I I II II I I II I II I I I II I I II I II M 

I I I II I I I 



Db 2443 

ACAGCTTTCATTGGAGACCTGGCTTCCCACTTTGGCTGCACCATTGGCCTGAAAGATTCT 2502 



Qy 2347 

GTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAA 2406 
II II II II II I I II I III I I I I II II II I I I I I II 

I II I I I I I I I I I 
Db 2503 

GT GACT GC AGT CGT GTT C GT C GCACTT GGAAC AT CAGT GC CAGAC AC ATTT GCCAGCAAA 2562 
Qy 2407 

GCTGCT GC CCT CCAGGAT GT ATAT GCAGACGCCT C CATTGGCAAC GT GAC GGGCAGCAAC 2466 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

I I I I I I I I I I I I 

Db 2563 

GT GGC AGC C AC C CAGGAC CAGT AT GCAGAC GC CT C CAT AGGTAAC GT CAC GGGCAGCAAC 2622 
Qy 2467 

GCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTACTGGGCT 2526 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II MINIMI 

I I 

Db 2623 

GCGGTGAATGTCTTCCTGGGAATCGGTGTGGCCTGGTCCATCGCTGCCATCTACCACGCA 2682 
Qy 2527 

CTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACCCTCTTC 2586 

III I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

I I I I I I 

Db 2683 

GCCAATGGGGAACAGTTCAAAGTGTCCCCTGGCACACTAGCTTTCTCTGTCACTCTCTTC 2742 
Qy 2587 

ACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCACCTGGGA 2646 

II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

III 

Db 2743 

ACCATTTTTGCTTTCATCAATGTGGGGGTGCTGCTGTATCGGCGGAGGCCAGAAATCGGA 2802 
Qy 2647 

GGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTGAGCCTG 2706 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

I I I 

Db 2803 

GGTGAGCTGGGTGGGCCCCGGACTGCCAAGCTCCTCACATCCTGCCTCTTTGTGCTCCTA 2862 
Qy 2707 

T GGCT CCT CT ACAT ACT CT TT GC C ACACT AGAGGCCT ATT GCT AC AT CAAGGGGTTCTAA 2766 

I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I 

I I I I I I 

Db 2863 

TGGCTCTTGTACATTTTCTTCTCCTCCCTGGAGGCCTACTGCCACATAAAAGGCTTCTAA 2922 
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Qy 142 

T GT T CAGGGT CAT C GGACT GCAAGGAGGGT GT CAT CCT GC CAAT CT GGT ACCCGGAGAAC 201 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

I I 

Db 145 

T GT ACT GGCT CAT ATT ACT GTAAGAAAGGGGT GAT CT T GC CCATTT GGGAAC CCCAAGAC 204 
Qy 202 

CCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTATTTTGTGGCCCTGATATACATG 261 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I II I I 

Db 205 

CCATCTTTTGGGGACAAAATTGCTAGAGCAACTGTGTATTTTGTGGCCATGGTCTACATG 2 64 



Qy 262 

TTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCATCTATTGAAGTCATCACCTCT 321 
I I I I I I I I II II II I I I I I I I I I I II III I I I I I I II 

I I I I I I I I I I I I 

Db 265 

TTCCTTGGAGTTTCTATTATTGCAGACCGGTTTATGTCCTCTATAGAGGTCATCACCTCT 324 



Qy 322 

CAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAAACCAGCACAACCACTATTCGG 381 

I I I I I I I II I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I 

I 

Db 325 

C AAGAGAAAGAAAT AAC G AT AAAGAAAC C GAAT G GAG AGAC C AC CAAGAC GAC GGT GAGA 384 



Qy 382 

GTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCCTCTGCTCCTGAG 441 
I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I II 

I I I I I I I I I I I I 
Db 385 

AT CT GGAAC GAGACT GT GT CGAAC CTGACCT T GAT GGCCCT GGGAT CTT CT GCT CCT GAG 444 



Qy 442 

ATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGATCTGGGACCT 501 

I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 445 

ATTCTCCTGTCAGTCATTGAAGTGTGCGGCCATAACTTCACCGCAGGGGACCTGGGTCCC 504 



Qy 502 

T CTACCATT GT AGGGAGT GC AGC CT TCAAC AT GT T CAT CAT CATT GGC AT CTGT GT CT AC 561 

I I I I I II II I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

I I I 

Db 505 

AGCAC CAT C GT GGGAAGT GCT GC CT TTAACAT GTT CAT CATAAT CGCACT CT GT GT T T AC 564 



Qy 562 

GTGAT CCCAGACGGAGAGACTCGCAAGATCAAGCATCTACGAGT CTT CTTCAT CACCGCT 62 1 

III I I I I II I I I I I I I I I II I I I I I I I I I I II II II I I I I I I II 

I I 

Db 565 

GT GGTCCCT GAT GGAGAGACAAGGAAGATCAAGCATCT GCGT GT GTTCTTT GT GACAGCA 624 



Qy 622 

GCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGGT 681 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

I I I I I 

Db 625 

GCCTGGAGCATCTTTGCCTATACCTGGCTTTATATAATCTTGTCTGTCAGCTCTCCTGGA 684 
Qy . 682 

GTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTG 741 

' . ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 685 

GTTGTGGAGGTCTGGGAAGGCTTGCTTACTTTCTTCTTCTTTCCCATCTGCGTTGTGTTC 744 



Qy 742 

GCCT GGGTGGCAGATAAACGACT GCTCTTCT ACAAATACAT GCACAAAAAGTACCGCACA 801 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



I 

Db 745 

GCGTGGGTAGCAGACAGGCGGCTTCTCTTTTACAAGTATGTCTACAAGCGGTACAGGGCC 804 

Qy 802 GACAAACACC GAGGAATT AT C ATAGAGACAGAGGGT GACC ACC C 

TAAGGGC 852 

I I I I I I MINIMUM M M Ml M I 

Db 805 

GGCAAGCAGAGGGGGAT GATCATT GAACATGAAGGAGACAGACCAGCTTCCAAAACTGAA 864 
Qy 853 

ATT GAGATGGAT GGGAAAAT GAT GAATT CCC ATTTT CT AGAT GGGAACCT GGT GCCCCT G 912 

M I I M M M I M I M M I M M M I M II III 

I I I 

Db 865 

AT CGAAATGGAT GGGAAAGT G GT CAACT CTCAT GTT GACAATTT CTT AGAT GGGGCT CT G 924 



Qy 913 GAAGGG 

AAGGAAGTGGAT GAGTCCCGCAGAGAGAT GATCCGG 954 

I I I II I I I I I I I I I II II I I 

I I 

Db 925 

GTT TT GGAAGT T GAT GAGAGGGACCAAGATGATGAGGAAGCC AGGCGT GAGAT GGCAAGG 984 



Qy 955 

ATT CTCAAGGATCT GAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATG 1014 

II I I I I I I II I I I I I I I II I I I I I I I II I I I I I I I I III 

I 

Db 985 

ATTCTGAAGGAACTTAAGCAGAAGCATCCTGAGAAAGAAATTGAGCAATTAATAGAATTA 1044 
Qy 1015 

GCCAATT ACT AT GCT CT T T C CCACCAACAGAAGAGC CGC GCCT T CT AC CGT AT CCAAGC C 1074 

II II I I I I I I II I I I I I I I I I M I I I M M I I I I M 

Mill 

Db 1045 

GC CAACT AC CAGGT C CTAAGT CAACAGCAGAAAAGC CGAGCAT TT T AC AGGATT CAAGCT 1104 



Qy 1075 

ACT CGT AT GAT GACT GGT GCAGGCAAT AT CCT GAAGAAACAT GCAGCAGAACAAGCCAAG 1134 

I II I I II I II I I I I I I I I I I I I I I I II M I M I I M I II II M I I 

I 

Db 1105 

ACT C GC CT GATGACCGGAGCT GGCAACAT CTT GAAGAGGCACGCAGCT GAT CAAGCAAGG 1164 

Qy 1135 AAGGCCT CCAGCAT GAGC GAGGT GCACACCGAT GAGCCT G 

AGGACT TTATT T C CAAG 1191 



Db 1165 

AAGGCT GT C AGT AT GCAT GAAGT CAACAT GGAAAT GGCT GAAAACGAC C CAGT CAGTAAG 1224 



Qy 1192 

GTCTTCTTTGACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACA 1251 

I I II I I I I I I I I I I II I I I I M M I II I I I I II I I MM II I 

I I 

Db 1225 

AT CTTCTTT GAGCAAGGAACAT AC CAGT GTCT AGAGAACT GT GGT ACT GT GGC CCT C AC C 1284 



Qy 1252 



GT GGT GAG GAAAGGGGGAGACAT GT CAAAGACCAT GT AT GTGGACT ACAAAACAGAGGAT 1311 

I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I 

I I 

Db 1285 

ATT AT GC GC AGAGGGGGCGACT T GAGCACCACT GT GTTT GTT GACTT CAGGACAGAAGAC 1344 



Qy 1312 

GGTTCTGCCAATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCA 1371 

II I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I III 

I I I 

Db 1345 

GGC ACAGCCAAT GCTGGGT CTGATT AT GAAT T CAC GGAAGGGACT GT GATCTTCAAAC CA 1404 



Qy 1372 

GGAGAGACCCAGAAGGAGTT CT CCGT GGGCATAATTGATGACGACATTTTTGAGGAGGAT 1431 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I 

I I I 

Db 1405 

GGGGAGAC C C AGAAGGAAAT CAGAGTTGGCAT CAT T GAT GAT GAT ATCTTT GAAGAAGAT 1464 



Qy 1432 

GAACACTTCTTT GTAAGGTT GAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGG 1491 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

I I 

Db 1465 

GAAAACT T CCTT GT GC ATCTTAGCAAT GT CAGAGT CT CTT C AGAT GTT T CAGAAGAT GGC 1524 



Qy 1492 

ATGCCTCCAGCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGT 1551 

II III I I II II I I II I I I I I I I 

I 

Db 1525 AT 

ACTAGAATCCAATCACGCTTCTTCAATTGCTTGTCTTGGGTCACCCAGC 1575 



Qy 1552 

GT GGC C ACAGTT AC CAT CTT GGAT GATGACC AT GC AGGC AT CTT CACTT TT GAAT GT GAT 1611 
I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II III 

Db 1576 

ACT GCCAC C ATAACCAT TTTT GATGAT GACCAT GCAGGCAT CTTT ACATTT GAGGAAC CC 1635 



Qy 1612 

ACT ATT CAT GT CAGTGAGAGT AT TGGTGTTAT GGAGGT CAAGGTT CTGC GGACAT CAGGT 1671 

I III II II I I I I I I I I I I I I I I I I I I I I I I I I I II I II II 

I I 

Db 1636 

GTGACT CACGTGAGCGAGAGCATTGGCAT CAT GGAGGT GAAGGTTTTGAGAACCTCTGGA 1695 



Qy 1672 

GCCCGGGGTACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGT 1731 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

I I 

Db 1696 

GCT CGAGGAAAT GTT AT CAT TC CCT ACAAAACT ATT GAAGGCACAGCCCGAGGT GGAGGG 1755 



Qy 1732 

GAGGACTTT GAAGACACAT ATGGGGAGTT GGAAT T CAAGAAT GAT GAAACT GTGAAAACC 1791 

II I I I I I I I I I I I I I I III III I I I I I I I I I I I I I I I I I I II 

I I I I I 

Db 1756 

GAAGACT T TGAGGACAC CT GT GGAGAGCT CGAATT C CAGAAC GAT GAAAT AGT CAAAACA 1815 



Qy 1792 

ATAAGGGTTAAAAT AGT AGAT GAGGAGGAAT ACGAAAGGCAAGAGAATTT CT T C ATT GCC 1851 

III I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I 

Db 1816 

ATATCAGTCAAGGTAATCGATGACGAGGAGTATGAGAAAAACAAGACCTTCTTCATTGAG 1875 
Qy 1852 

CTT GGTGAAC CGAAAT GGAT GGAAC GT GGAAT AT CAGAT GT GAC AGACAGGAAG 1905 

II I I I I I I I I I II I II 

Db 1876 

AT TGGAGAGCCC CGT CT GGT GGAGATGAGT GAGAAGAAAGNNNNNNNNNNNNNNNNNNNN 1935 
Qy 1906 

1905 

Db 1936 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGC C AACCT AT CT T CAGGAAGGT C CAT 1995 
Qy 1906 

190 5 

Db 1996 

GCT AGAGAT CAT C C GATT CC CT CT ACT GT AAT CAC CAT CT CAGAGGAATAT GAT GACAAG 2055 
Qy 1906 

CTGACTATGGAAGAAGAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTA 1959 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 2056 

CAGCCACT GAC CAGCAAAGAAGAGGAGGAGAGGC GCAT TGC GGAAAT GGGGCGC CC CATC 2115 
Qy 1960 

TT GGGT GAACAC C CCAAACT AGAAGTC ATC AT T GAAGAGT CCT AT GAGTT CAAGACT ACG 2019 

I II II III I I I I II II II I I I I I I I I I I I I I II II I I I I I II 

I I 

Db 2116 

CTAGGCGAGCACACCAAGCTGGAGGTGATCATCGAAGAGTCTTACGAATTCAAGAGCACT 2175 
Qy 2020 

GT GGACAAACT GAT CAAGAAGACAAAC CT GGCCT T GGTTGT GGGGACC CATTCCT GGAGG 2079 
I II I I I I I II I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

I I I I I I 

Db 2176 

GT GGACAAACT CATTAAGAAGAC GAAC CTGGC C CTT GT GGT GGGGAC CAAC AGCT GGAGA 2235 
Qy 2080 

GACCAGTT CAT GGAGGCCAT CACC GT CAGT GC AGCAGGGGAT GAGGAT GAGGAT GAAT CC 2139 

I I I I II I II I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2236 

GAGC AGT T TAT C GAAGCCAT C ACT GT CAGCGCT GGGGAAGAT GAC GAT GAT GAT GAAT GT 2295 
Qy 2140 

GGGGAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGG 2199 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II 

I I I I I I 

Db 2296 

GGGGAGGAGAAGCTGCCCTCCTGTTTTGATTACGTGATGCACTTTCTCACAGTGTTCTGG 2355 



Qy 2200 



AAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCC 2259 

I I I I I I I I I II I I I I II II II I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 2356 

AAGGTTCTGTTTGCCTTCGTCCCACCTACAGAATACTGGAATGGCTGGGCCTGCTTCATT 2415 
Qy 2260 

GTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTC 2319 

I I I I I I I I I I I I I I II III I II I I I I I I I I I I II I I I I I I I I I II 

Mill 

Db 2416 

GTCTCCATCCTCATGATCGGCCTACTGACCGCCTTCATTGGAGACCTGGCTTCCCACTTT 2475 
Qy 2320 

GGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACC 2379 

I I I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I I II II II I I I I 

I I I 

Db 2476 

GGCTGCACCATTGGTCTGAAAGATTCCGTGACTGCCGTTGTGTTTGTTGCTCTTGGAACC 2535 
Qy 2380 

T CT GT C CCAGAT ACGT TT GCCAGCAAAGCT GCT GC CCT CC AGGAT GT AT AT GCAGAC GCC 2439 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

I I 

Db 2536 

TCGGTGCCAGACACATTTGCCAGCAAAGTAGCAGCTACCCAGGACCAGTATGCAGATGCG 2595 
Qy 2440 

TCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCC 2499 

II II II I I I II I I II I I I I I II II I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 2596 

T CT AT AGGCAAT GTCACT GGAAGCAAT GCT GT GAAT GT CTT C CT GGGAAT CGGCGT GGCC 2655 
Qy 2500 

TGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGC 2559 

I I I I I I I I I I I I I I I I I I II III I I I I I I I I I I I I 

I I II I 

Db 2656 

TGGTCCATTGCTGCCATCTACCATGCGGCCAACGGGGAACAGTTCAAAGTGTCCCCCGGC 2715 
Qy 2560 

ACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTC 2619 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I 

I I I I I 

Db 2716 

ACGCTAGCTTTCTCTGTCACTCTCTTCACTATTTTTGCTTTCATCAACGTGGGGGTGCTG 2775 
Qy 2620 

TTGTACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTC 2679 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I 

Db 2776 

CTGTATCGGCGGAGGCCAGAAATAGGAGGTGAGCTGGGAGGGCCCCGGACTGCCAAGCTC 2835 
Qy 2680 

GCC ACAACAT GGCTCT TT GT GAGC CTGT GGCT C CT CT AC AT ACT CT TTGC CAC ACT AGAG 2739 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2836 

CTCACATCTTCCCTGTTTGTGCTCCTGTGGCTCTTGTACATTTTCTTCTCCTCCCTGGAA 2895 



Qy 2740 GCCTATTGCTACATCAAGGGGTTCTAA 2766 

I I I I I III I I I I I I I I I I II I I I 
Db 2896 GCCTACTGCCACATAAAGGGCTTCTAA 2922 



linear GSS 12- 



RESULT 9 
AY408693 

LOCUS AY408693 2516 bp DNA 

DEC-2003 

DEFINITION Homo sapiens HCM3309 gene, VIRTUAL TRANSCRIPT, partial 
sequence, 

genomic survey sequence. 
ACCESSION AY408693 

VERSION AY408693.1 GI:39764664 

KEYWORDS GSS. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 2516) 

AUTHORS Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., 
Kejariwal, A. , 

Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B. , 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams ,M.D. and Car gill, M. 
TITLE Inferring nonneutral evolution from human- chimp-mouse 

orthologous 

gene trios 

JOURNAL Science 302 (5652), 1960-1963 (2003) 
PUBMED 14671302 
REFERENCE 2 (bases 1 to 2516) 

AUTHORS Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., 
Kejariwal, A. , 

Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
TITLE Direct Submission 

JOURNAL Submitted ( 16-NOV-2003 ) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and 



COMMENT 
ordering 

FEATURES 

source 



gene 
ORIGIN 



them based on alignment. 

Location/Qualifiers 
1. .2516 

/organism="Homo sapiens" 
/ mo l__type=" genomic DNA" 
/db_xref="taxon: 9606" 
<1. .>2516 
/locus_jtag="HCM3309" 



Query Match 41.6%; Score 1151.4; DB 9; Length 2516; 

Best Local Similarity 66.0%; Pred. No. 6e-308; 

Matches 1673; Conservative 0; Mismatches 809; Indels 51; 
Gaps 5 ; 



Qy 266 

T T GGGGT GT C CAT C ATT GCT GAC C GCTT CAT GGCAT CT ATT GAAGT C ATCACCT CT CAAG 325 



I II 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I II II 1 1 1 1 1 1 1 1 II 

I 1 1 

Db 1 

T GGGAGT GT CCAT C ATCGCC GACC GTTT C AT GGCGGCCAT CGAGGT CATCACGT CAAAAG 60 
Qy 326 

AGAGGGAGGT GACAATTAAGAAAC CCAAT GGAGAAACCAGCACAACCACT ATT C GGGT CT 385 

II I I I I I I I I I I I II I I I I I I I I I I I I I I III I I I I 

I I I 

Db 61 

AGAAGGAGAT CACC ATCACCAAGGCCAAC GGT GAGACCAGCGT GGGCACCGTT CGCAT CT 120 



Qy 386 

GGAATGAAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCCTCTGCTCCTGAGATAC 445 

II I I I II II II I I I I I I I I II II I I II II I II I I I I I II II I I I I I I I I 

I 

Db 121 

GGAAT GAGAC GGT GT CCAAC CT C ACGCT CAT GGC C CT GGGCT C CT C C GCACCT GAGAT CC 180 



Qy 446 

TCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGATCTGGGACCTTCTA 505 

Mill I I I II I I I I I I I I III I I I I I I I I I I I I I I 

I 

Db 181 

TGCTGTCAGTCATCGAAGTCTGCGGCCACAACTTCCAGGCGGGTGAGCTGGGCCCAGGCA 240 



Qy 506 

CCATT GT AGGGAGT GCAGC CT T CAACAT GT T CAT CAT CATT GGC AT CT GT GT CT ACGT GA 565 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I 

I 

Db 241 

CCATCGTGGGCAGCGCTGCCTTCAACATGTTTGTGGTCATCGCCGTGTGCATCTACGTCA 300 



Qy 566 

T CCCAGACGGAGAGACT CGCAAGAT CAAGCAT CT ACGAGT CTT CT TCAT CACC GCT GCTT 62 5 

I I I I I I III I I II I I I I I I I II I I I I I II I I I I I I I I I I I II I II 

I I I 

Db 301 

TCCCAGCCGGCGAGAGCCGCAAGATCAAGCACCTGAGAGT CTT CTTTGTCACT GCCT CTT 360 
Qy 626 

GGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGG 685 

II I I I I I I I I I I I I I I II U I III I II II II II II I I I I I 

I I I I I I I 

Db 361 

GGAGCATCTTCGCCTATGTCTGGCTTTATCTCATCCTTGCTGTTTTTTCCCCCGGTGTGG 420 



Qy 686 

TCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCT^ '745 

I I I I I I I I I I I I I II II M II I I I I II I II I I I I I I I I I 

I I I I 

Db 421 

TCCAGGTGTGGGAGGCGCTGCTGACCCTGGTCTTCTTCCCGGTGTGCGTGGTATTCGCCT 480 
Qy 746 

GGGT GGC AGATAAAC GACT GCT CTT CT ACAAAT ACAT GCACAAAAAGT ACC GCACAGACA 805 

II I I I I II II II I I I I I I I I I I I I I I II I II I I I I I I I I I I II 

I I I 

Db 481 

GGATGGCCGACAAGCGGCTGCTCTTCTACAAGTACGTGTACAAGCGCTACCGCACCGACC 540 



Qy 806 

AACACCGAGGAATT ATCATAGAGACAGAGGGT GAC CACCCTAAGGGCATT GAGAT GGATG 865 

II I I II II I I I I I I I I I I I I I I I I I III III I I I I III I I I I 

I 

Db 541 

CACGC AGC GGC AT CATCAT AGGC GCCGAGGGCGAC C C CCCGAAGAGCATC GAGCT GGACG 600 

Qy 866 GGAAAAT GAT GAATTCC C ATT TT CTAGAT GGGAA 

CCTGGTGCCCCTGG 913 

II III III I I I II I I I I I I I I 

I 

Db 601 

GCACGTTCGTGGGCGCCGAGGCCCCAGGTGAGCTGGGCGGCCTGGGCCCGGGCCCCGCCG 660 
Qy 914 

AAGGGAAGGAAGT GGAT GAGT C CCGCAGAGAGAT GAT C CGGAT T CT CAAGGAT CT GAAGC 973 

III II I I I I I I I I I I I I I I II I I I II I I I I I I I I I I 

I I I I 

Db 661 

AGGCGCGCGAGCTGGACGCCAGCCGCCGCGAGGTCATCCAGATCCTCAAGGACCTCAAGC 720 
Qy 974 

AAAAACAC C CAGAGAAGGACT T AGAT C AGCT GGT GGAGAT GGC CAAT TACT AT GCT CTTT 1033 

I II I I I I I II I I I I I I II I I I I I I I I I I II I II I I I I I I I II II 

Db 721 

AGAAGC ACCC GGACAAGGAT CT GGAGCAGCT GGT GGGCAT CGC CAACT ACT AC GC GCT GC 780 
Qy 1034 

C CCAC CAACAGAAGAGCCGC GC CT T CT ACC GT AT C CAAGC C ACT CGT ATGAT GACT GGT G 1093 

Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill II II I I II I II 

I 

Db 781 

T GCACCAGCAGAAGAGC CGC GC CTT CT ACC GCAT C CAGGC CACGC GGCTGATGACCGGCG 840 
Qy 1094 

CAGGCAAT ATCCTGAAGAAACAT GCAGCAGAACAAGCCAAGAAGGCCT CCAGCATGAGCG 1153 

I II I I I I II I I I I I I I I II I Mil l I 

I I 

Db 841 CCGGGAACGTGCTGCGCAGACACGCGGCGGACGCCTCGCGCAGGGCGGC 

GCCGGCCG 897 

Qy 1154 

AGGT GCACAC CGAT GAGCCT GAGGACTT T ATTT CCAAGGT CTT CTTT GACCCAT GTT CTT 1213 

Ml Mill Mill I M II M I II M I 

I 

Db 898 

AGGGCGCGGGCGAGGACGAAGACGACGGCGCCAGCCGCATCTTCTTCGAGCCTAGCCTCT 957 
Qy 1214 

ACCAGT GC CT GGAGAACT GT GGGGCT GT ACT CCT GACAGT GGT GAGGAAAGGGGGAGACA 1273 

II II M II M II II I I I I II I I I I I M I I II II I M I I I I 

Db 958 

ACCACTGCCTGGAGAACTGCGGCTCCGTGCTGCTGTCCGTCACGTGCCAGGGCGGCGAGG 1017 
Qy 1274 

T GT CAAAGACCAT GTAT GTGGACTACAAAACAGAGGATGGTT CTGCCAATGCAGGGGCT G 1333 

I I M I I I II I II II I I II I M II I I II I II I II I I M I 

I 

Db 1018 



GCAACAGCACCTTCTACGTGGACTACCGCACTGAGGACGGCTCTGCCAAGGCGGGCTCCG 1077 



Qy 1334 

ACT AT GAGTT C ACAGAGGGCAC GGT GGT T CTGAAGCCAGGAGAGACCCAGAAGGAGTT CT 1393 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1078 

ACT AC GAGT ACAGNGAGGGCAC GCT GGT GT T CAAACCAGGCGAGACGCAGAAGGAGCT GC 1137 
Qy 1394 

C CGT GGGC AT AAT T GAT GAC GACATTTTT GAGGAGGAT GAACACT T CTT TGTAAGGT T GA 1453 

I I I I I I I II I I II I I I I I I II I I I I I I I I II II I I I I I II II 

I I 

Db 1138 

GCATCGGCATCATCGACGACGACATCTTCGAGGAGGACGAGCATTTCTTCGTGCGGCTGC 1197 
Qy 1454 

GCAAT GT CCGCAT AGAGGAGGAGCAGCCAGAGGAGGGGAT GCCT CC AGCAAT AT T CAACA 1513 
II I I I I I I I I I II I I I II I 

I I 

Db 1198 TGAACCTGCGCGTGGGCGACGCGCAGGGCATGTTCGAGCCG 

GACG 1242 

Qy 1514 

GTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGG 1573 

I III II III III I I I I I I I II I I I I I I 

I I I 

Db 1243 

GCGGCGGGCGGCCCAAGGGGCGGCTGGTGGCGCCGCTGCTGGCCACCGTCACCATCCTGG 1302 
Qy 1574 

AT GAT GAC CAT GCAGGC AT CTT CACTTTT GAAT GT GAT ACT ATT CAT GT CAGT GAGAGTA 1633 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1303 

AC GAC GAC C ACGCAGGC AT CTT CT CCTT C C AGGACC GC CT GCT GCACGT GAGCGAGT GCA 1362 
Qy 1634 

T T GGT GT T AT GGAGGT CAAGGT T CT GCGGAC AT CAGGT GCC CGGGGT ACAGT CAT CGT CC 1693 

III I I I I I I II I I I I I I I I I I I I I I I I I I I II 

I 

Db 1363 

TGGGCACCGTGGACGTGCGCGTCGTGCGCAGCTCGGGCGCGCGCGGCACCGTGCGCCTTC 1422 
Qy 1694 

CCTT T AGGACAGT AGAAGGGACAGC CAAGGGT GGC GGT GAGGACTT TGAAGACACATATG 1753 

Ml I I I I I I I I I I I II I I I I II I I I II I I I I I I I I 

I 

Db 1423 

CCTACCGCACGGTGGACGGCACGGCGCGCGGCGGCGGCGTGCACTACGAGGACGCGTGCG 1482 
Qy 1754 

GGGAGTT GGAATT CAAGAAT GAT GAAACT GT GAAAAC CATAAGGGTT AAAAT AGT AGAT G 1813 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

I II I 

Db 1483 

GAGAGCTGGAGTTT GGCGACGACGAGACCATGAAAACT CTTCAGGTGAAGATAGTTGATG 1542 
Qy 1814 

AGGAGGAAT ACGAAAGGCAAGAGAATTT CT T CAT T GCC CTTGGTGAAC CGAAAT GGAT GG 1873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I 



Db 1543 

ACGAGGAATATGAGAAAAAGGATAATTTCTTCATTGAGCTGGGCCAGCCCCAGTGGCTTA 1602 



Qy 1874 AAC GT GGAAT AT C AGAT GT GACA 

GACAGGAAGCT GACT AT GG 1915 

I I I I I I I I I I I I I I I II I II I I I I 

I 

Db 1603 

AGCGAGGGATTTCAGNNNNNNNNNNNNNNNNNNGGGATGGGGACAGGAAGCTAACAGCCG 1662 
Qy 1916 

AAGAAGAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTGGGTGAACACCCCA 1975 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II 

I 

Db 1663 

AGGAGGAGGAGGCT CGGAGGAT AGCAGAGAT GGGCAAGC C AGT T CT T GGGGAGAACT GCC 1722 



Qy 1976 

AACTAGAAGTCATCATT GAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACT GATCA 2035 
II II I I I I I I I I II I I I I I I I I I I II III 

Db 1723 

GGCT GGAGGT CAT CAT C GAGGAGTCAT AT GATTTTAAGNNNNNNNNNNNNNNNNNNNNNN 1782 



Qy 2036 

AGAAGACAAAC CT GGCCTT GGTT GT GGGGAC C CATTC CT GGAGGGACC AGTT CAT GGAGG 2095 



Db 1783 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 1842 



Qy 2096 CCATCACCGTCAGTGCA 

GCAGGGGAT GAGGATGAGGATGAAT C C GGGGAGGAGAGGC 2152 

I I I I I I I I I I I I II I I I I I I I I I I I I I 

I I I 

Db 1843 

NNNNNNNNNNNNNNNNNNGG GAC GAGGAGGAGGAGGAGGACGGGT C CCGGGAGGAGC GGC 1902 



Qy 2153 

TGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTG 2212 

I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M 

I 

Db 1903 

TGCCGTCGTGCTTTGACTACGTGATGCACTTCCTGACGGTGTTCTGGAAGGTGCTCTTCG 1962 



Qy 2213 

CCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCA 2272 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1963 

CCTGTGTGCCCCCCACCGAGTACTGCCACGGCTGGGCCTGCTTTGGTGTCTCCATCCTGG 2022 
Qy 2273 

TCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTG 2332 

I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2023 

TCATCGGCCTGCTCACCGCCCTCATTGGGGACCTCGCCTCCCACTTCGGCTGCACCGTTG 2082 



Qy 2333 

GTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATA 2392 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 



Db 2083 

GCCTCAAGGACTCTGTCAATGCTGTTGTCTTCGTTGCCCTGGGCACCTCCATCCCTGACA 2142 



Qy 2393 

C GTTT GCCAGCAAAGCT GCT GCC CT C CAGGATGT AT AT GCAGACGCCT C CAT T GGCAACG 2452 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I 

Db 2143 

CGTTCGCCAGCAAGGTGGCGGCGCTGCAGGACCAGTGCGCCGACGCGTCCATCGGCAACG 22 02 



Qy 2453 

TGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCG 2512 
I I I I III I I I I I I II II II I I I I I II I I III I I I I I II I I 

I I I I I I I 

Db 2203 

TGACCGGCTCCAACGCGGTGAACGTGTTCCTTGGCCTGGGCGTCGCCTGGTCTGTGGCCG 2262 



Qy 2513 

CCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCT 2572 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I I I 

Db 2263 

CCGTGTACTGGGCGGTGCAGGGCCGCCCCTTCGAGGTGCGCACTGGCACGCTGGCCTTCT 2322 



Qy 2573 

CCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGC 2632 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2323 

CCGTCACGCTCTTCACCGTCTTCGCCTTCGTGGGCATTGCCGTGCTGCTGTACCGGCGCC 2382 



Qy 2633 

GGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGC 2692 

I I I I I I I I I II II I I I I I II I I II I II II I I I I I I I I I I I I II 

I I 

Db 2383 

GGCCGCACATCGGCGGCGAGCTGGGCGGCCCGCGCGGACCCAAGCTCGCCACCACCGCGC 2442 



Qy 2693 

TCTTT GT GAGCCT GTG GCT C CTCT ACAT ACT CTTT GC CACACTAGAGGCCT AT T GCTACA 2752 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2443 

TCTTCCTGGGCCTCTGGCTCCTGTACATCCTCTTCGCCAGCCTGGAGGCGTACTGCCACA 2502 

Qy 2753 TCAAGGGGTTCTA 2765 

II I I I I I I I I 

Db 2503 TCCGGGGCTTCTA 2515 



AY398962 



2881 bp 



DNA 



linear GSS 15- 



RESULT 10 
AY398962 
LOCUS 
DEC-2003 

DEFINITION Pan troglodytes SLC8A1 gene, VIRTUAL TRANSCRIPT, partial 
sequence, 

genomic survey sequence. 
ACCESSION AY398962 

VERSION AY398962.1 GI: 39754951 

KEYWORDS GSS. 



SOURCE Pan troglodytes (chimpanzee) 

ORGANISM Pan troglodytes 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 
REFERENCE 1 (bases 1 to 2881) 

AUTHORS Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P . , 
Kejariwal, A. , 

Todd, M. A., Tanenbaum, D.M. , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
TITLE Inferring nonneutral evolution from human- chimp-mouse 

orthologous 

gene trios 

JOURNAL Science 302 (5652), 1960-1963 (2003) 
PUBMED 14671302 
REFERENCE 2 (bases 1 to 2881) 

AUTHORS Clark, A. G. , Glanowski, S . , Nielson,R., Thomas, P., 
Kejariwal, A. , 

Todd, M. A., Tanenbaum, D.M. , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
TITLE Direct Submission 

JOURNAL Submitted ( 16-NOV-2003) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and 



COMMENT 
ordering 

FEATURES 

source 



gene 



ORIGIN 



them based on alignment. 

Location/Qualifiers 
1. .2881 

/organism="Pan troglodytes" 

/mol_type=" genomic DNA" 

/db_xref="taxon: 9598" 

<1. ,>2881 

/gene="SLC8Al" 

/locus_tag="HCM0065 M 



Query Match 35.4%; Score 978.4; DB 9; Length 2881; 

Best Local Similarity 56.4%; Pred. No. 8e-260; 

Matches 1587; Conservative 0; Mismatches 1057; Indels 171; 
Gaps 5; 



Qy 109 

GAC GT GCCAAGCACAGGGC AGAACAAT GAGT CCT GTT CAGGGT CAT CGGACT GCAAGGAG 168 

II II II III I I I I I I I I I I I I II I I I I I I I I 

I 

Db 76 

GAAAT GGAAGGAGAAGGAAAT GAAACTGGT GAAT GT ACT GGAT CAT AT TACT GTAAGAAA 135 



Qy 169 

GGT GT CAT C CT GCCAAT CT GGT ACCC GGAGAACCCTT C C CT T GGGGACAAGATT GCCAGG 228 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 136 

GGGGTGATTTTGCCCATTTGGGAACCCCANNNNNCTTCTTTTGGGGACAAAATTGCTNNN 195 



Qy 229 

GTCATTGTCTATTTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGAC 288 
I I I I I I I I I I I I I I I I I II I I I I 

Db 196 



GCTACTGTGTATTTTGTGGCCATGGTCTACNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 255 



Qy 289 

C GCT T C ATGGCAT CTATT GAAGTCAT CAC CT CTCAAGAGAGGGAGGT GACAATTAAGAAA 348 
Db 256 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 315 
Qy 349 

CC CAAT GGAGAAAC CAGCACAAC CAC TAT T C GGGT CT GGAAT GAAACTGT CT C CAAC CT G 408 
Db 316 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 375 
Qy 409 

ACCCTTATGGCCCTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGT 468 
Db 376 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 435 
Qy 469 

GGTCATGGGTTCATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTC 528 

I I I I II II I I I I I II 

I I I 

Db 436 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN C CAT C GT GGGAAGT GCT GCATT C 495 
Qy 529 

AACATGTTCATCATCATT GGCATCTGTGT CTACGT GAT CCCAGACGGAGAGACTCGCAAG 588 

II I I I I I I I I I II I I I I II I I I I I II Ml I I I I I I I I I I I I I I I I 

I I I 

Db -496 

AAT ATGTT C AT CATT ATT GCACTCT GT GT TT AT GT GGT C C CT GACGGAGAGACAAGGAAG 555 
Qy 589 

AT CAAGCAT CT AC GAGT CTT CT TCATCAC C GCT GCT T GGAGT AT CT TT GC CT ACAT CT GG 648 



Db 556 

ATTAAGCATTTGCGTGTGTTCTTTGTGACAGCAGCCTGGAGCATCTTTGCCTACACCTGG 615 
Qy 649 

CTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTC 708 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I 

I I 

Db 616 

CTTTACATTATTTTGTCTGTCATATCTCCTGGTGTTGTGGAGGTCTGGGAAGGTTTGCTT 675 
Qy 709 

ACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTC 768 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

I I 

Db 676 

ACTTTCTTCTTCTTTCCCATCTGTGTTGTGTTCGCTTGGGTAGCGGATAGGAGACTTCTG 735 
Qy 769 

TT CT ACAAAT AC ATGCACAAAAAGT AC C GC AC AGACAAAC AC C GAGGAAT TAT C AT AGAG 828 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 736 



TTTTACAAGTATGTCTACAAGAGGTATCGAGCTGGCAAGCAGAGGGGGATGATTATTGAA 795 



Qy 829 ACAGAGGGT GACCACCC 

TAAGGGCATTGAGATGGATGGGAAAAT GAT GAAT 879 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 796 

CAT GAAGGAGAC AGGCCAT CT T CT AAGACT GAAATTGAAAT GGAC GGGAAAGT GGT CAAT 855 



Qy 880 

TC CCATTTT CT AGATGGGAAC CT GGT GC CC CT GGAAGGGAAG 921 

I I I I I I I I I I I I II I I III 

Db 856 

T CT CAT GTT GAAAATTT CTT AGAT GGT GCT CT GGT TCT GGAGGT GGAT GAGAGGGACCAA 915 



Qy 922 

GAAGTGGAT GAGTCCCGCAGAGAGAT GATCCGGATTCTCAAGGATCT GAAGCAAAAACAC 981 

III MM I I I I M I II I I I I II I I I I M I I I I I M I I 

I I 

Db 916 

GAT GAT GAAGAAGCTAGGC GAGAAAT GGCT AGGAT TCT GAAGGAACT TAAGCAGAAGCAT 975 



Qy 982 

CCAGAGAAGGACTTAGATCAGCT GGT GGAGATGGCCAATTACTAT GCT CTTTCCCACCAA 1041 
I I I I I I I II M II I I I III I I I I I I I I I I II 

I I 

Db 976 

CCAGATAAAGAAATAGAGCAATTAATAGAATTAGCTAACTACCAAGTCCTAAGTCAGCAG 1035 



Qy 1042 

CAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGTAT GAT GACTGGT GCAGGCAAT 1101 

II II I I I I I M II I I I I II II I I II II I I I I I I I I I I I 

I I II I 

Db 1036 

CAAAAAAGTAGAGCATTTTATCGCATTCAAGCT ACTCGCCT CAT GACTGGAGCT GGCAAC 1095 
Qy 1102 

AT CCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCCT CCAGCATGAGCGAGGTGCAC 1161 

II I I II I I I II I I I I I I I II I I I I I II I I II I I I I I I M I I I 

I I 

Db 1096 

AT TTTAAAGAGGCATGCAGCT GAC CAAGCAAGGAAGGCT GT C AGCAT GCAC GAGGT CAAC 1155 



Qy 1162 ACCGATGAGCCTG 

AGGACTTTATTT CCAAGGT CTTCTTT GACC CAT GTTCT T AC CAG 1218 

I I I I I I I II I I I I III I I I I I I I I I M I I I I I III 

I I I 

Db 1156 

ACT GAAGTGACT GAAAATGACCCT GTTAGTAAGATCTTCTTT GAACAAGGGACATATCAG 1215 
Qy 1219 

T GC CT GGAGAACT GT GGGGCT GT ACT C CTGACAGT GGT GAGGAAAGGGGGAGAC AT GT CA 1278 

I I I I I II I I I II II I I I II I I II II I I I I II I I I I I II 

I 

Db 1216 

T GT CT GGAGAACT GT GGT ACT GT GGCC CTT AC C ATT AT C CGCAGAGGT GGT GAT T T GACT 1275 



Qy 1279 

AAGAC CAT GT AT GT GGACT ACAAAACAGAGGAT G GT T CT GCCAAT GC AGGGGCT GACT AT 1338 

I I I I II I I M I II I I I I I II I I I II I II I II I I II I II I I I I I 



Db 1276 

AAC ACT GTGTTT GTT GACTT CAGAAC AGAGGAT GGCAC AGCAAATGCTGGGT CT GATT AT 1335 
Qy 1339 

GAGTTCACAGAGGGCACGGT GGTTCTGAAGCCAGGAGAGACCCAGAAGGAGTTCTCCGT G 1398 

I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 1336 

GAAT TT ACT GAAGGAACT GT GGT GTT TAAGC CTGGTGAGACCCAGAAGGAAAT C AGAGT G 1395 
Qy 1399 

GGCATAATT GAT GAC GAC ATTT TT GAGGAGGAT GAACACTT CT TT GTAAGGT T GAGCAAT 1458 

II II II I I I II II II I I I I I I I I I I I I I I I I III I I I I I 

I I I I I I 

Db 1396 

GGT AT C AT AGAT GAT GAT AT CT TT GAGGAGGAT GAAAATTT CCTT GT GCAT CT CAGCAAT 1455 
Qy 1459 

GTC C GC ATAGAGGAGGAGCAGC CAGAGGAGGGGAT GCCTC CAGCAAT ATT CAACAGT CTT 1518 

I I I II II I I I I I I I I I I I III I I I I 

I 

Db 1456 GT CAAAGT AT CT T CT GAAGCTT CAGAAGAT GGCAT ACT GGAAGC 

CAATCAT 1506 

Qy 1519 

CCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGATGAT 1578 

I II III I I I I I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 1507 

GTTTCTACACTTGCTTGCCTCGGATCTCCCNNCACTGCCACTGTAACTATTTTTGATGAT 1566 
Qy 1579 

GACCAT GCAGGCAT CTT CACTT T T GAAT GT GAT ACTAT TCAT GT C AGT GAGAGT ATT GGT 1638 

I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I 

I I I I I 

Db 1567 

GACCACGCAGGCATTTTT ACTTTT GAGGAACCTGTGACTCATGT GAGTGAGAGCATT GGC 1626 
Qy 1639 

GTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGTACAGTCATCGTCCCCTTT 1698 

I I I I I I I I I I I I I II I Mill !i II II II I I I I I I I I I I I 

I 

Db 1627 

ATCATGGAGGTGAAAGTATTGAGAACATCTGGAGCTCGAGGAAATGTTATCGTTCCATAT 1686 
Qy 1699 

AGGACAGTAGAAGGGACAGCCAAGGGT GGC GGT GAGGACTT T GAAGACACAT AT GGGGAG 1758 

" I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I III I I I I 

I I I 

Db 1687 

AAAACCATCGAAGGGACTGCCAGAGGT GGAGGNGAGGATTTTGANNACACTT GTGGAGAG 1746 
Qy 1759 

T T GGAATT CAAGAAT GAT GAAACT GT GAAAAC CATAAGGGTTAAAAT AGT AGAT GAGGAG 1818 
I I I I I I I 

Db 1747 

CT CGAAT T C CNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 1806 



Qy 1819 



GAAT AC GAAAGGCAAGAGAATTT CT T C ATT GCC CTTGGTGAAC C GAAAT GGATGGAACGT 1878 



Db 1807 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 1866 
Qy 1879 

GGAAT AT C AGAT GT GAC AG AC AGGAAG 1905 

Db 1867 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 1926 
Qy 1906 

1905 

Db 1927 

NNNNNNN GC CAAC CT GT CT T C AGGAAGGTT CAT GCT AGAGAACAT C C GAT T CT CT CT ACT 1986 

Qy 1906 

CT GACT AT GGAAGAAGAGGAG 1926 

I I I I I I I I I I II 

I I I 

Db 1987 

GTAATCAC C ATT GCAGNNGAATAT GAT GACAAGCAGCCACT GACCAGCAAAGAGGAAGAG 2046 
Qy 1927 

GC CAAGAGGATAGCAGAGAT GGGAAAGC CAGT AT T GGGTGAACAC C C CAAACT AGAAGT C 1986 

I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I 

I I I II 

Db 2047 

GAGAGGC GC ATT GCAGAAAT GGGGC GC C C CAT C CT GGGAGAGCACAC CAAGTT GGAAGT G 2106 
Qy 1987 

ATCATTGAAGAGT CCTAT GAGTTCAAGACTACGGTGGACAAACT GATCAAGAAGACAAAC 2046 
I I I I I I II I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I II 

I I I I I I I I I I I I 
Db 2107 

AT CATT GAAGAAT CCTAT GAAT T CAAGAGT ACT GT GGACAAACT CATTAAGAAGACAAAC 2166 
Qy 2047 

CTGGC CTT GGTT GT GGGGACC C ATT CCT GGAGGGACCAGTT CAT GGAGGC CAT CAC CGT C 2106 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

I I I 

Db 2167 

CTGGCCCTT GTGGTT GGGACT AACAGCT GGAGAGAAC AGTT CATTGAAGCTAT CACTGT C 2226 
Qy 2107 

AGT GCAGCAGGGGAT GAGGAT GAGGAT GAAT C CGGGGAGGAGAGGCT GCCCT CCT GCTTT 2166 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

I I 

Db 2227 

AGT GCT GGGGAAGAT GAT GAC GAC GAT GAAT GT GGGGAAGAGAAGCT GCCCT C CT GT TT C 2286 
Qy 2167 

GACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTGCCCCCC 2226 

II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I I I I 

Db 2287 

GATTACGTGATGCACTTTCTGACTGTGTTCTGGAAGGTCTTGTTTGCCTTCGTCCCCCCT 2346 



Qy 2227 



ACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTC 2286 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2347 

ACTGAATACTGGAATGGCTGGGCGTGTTTCATTGTCTCCATCCTCATGATTGGCCTACTG 2406 
Qy 2287 

ACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGATTCA 2346 

II II I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I II 

I I I I I I I I 

Db 2407 

ACAGCTTTCATTGGAGACCTGGCTTCCCACTTTGGCTGCACCATTGGCCTGAAAGATTCT 2466 
Qy 2347 

GTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAA 2406 
II II II II II I I I I I III I I I I II II II I I I I I II 

I I I I I I I I I I I I 

Db 2467 

GTGACTGCAGTCGTGTTCGTCGCACTTGGAACATCAGTGCCAGACACATTTGCCAGCAAA 2526 
Qy 2407 

GCT GCT GCC CTCCAGGAT GT AT AT GCAGAC GC CT CCAT TGGCAACGT GACGGGC AGCAAC 2466 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I I I I I 

Db 2527 

GT GGCAGCCACCCAGGACCAGTATGCAGACGCCTCCATAGGTAACGT CACGGGCAGCAAC 2586 
Qy 2467 

GCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTACTGGGCT 2526 

II II II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2587 

GCGGTGAACGTCTTCCTGGGAATCGGTGTGGCCTGGTCCATCGCTGCCATCTACCACGCA 2646 
Qy 2527 

CTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACCCTCTTC 2586 



Db 2647 

GCCAACGGGGAACAGTTCAAAGTGTCCCCTGGCACACTAGCTTTCTCTGTCACTCTCTTC 2706 
Qy 2587 

ACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCACCTGGGA 2646 

I I I I I II I I I I I II I I I I I I I I I I I I I II II I 

I 

Db 2707 

ACCATTTTTGCTTTCATCAATGTGGGGGTGCTGCTGTATCGGCGGANNCCAGAAANNNGN 2766 
Qy 2647 

GGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTGAGCCTG 2706 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2767 

GGTNNNNNGGGTNGGCCCCGGACTGCCAAGCTCCTCACATNCTGCCTCTTTGTGCTCCTA 2826 

Qy 2707 T GGCT CCT CT ACAT ACT CT TT GCCACACT AGAGGC CT AT TG CT ACAT CAAGGGGT 

2761 

I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I 
Db 2827 TGGCTCTTGTACATTTTCTTCTCCTCCNTGGAGGCCTACTGCCACATAAAAGGCT 

2881 



RESULT 11 
CNSLT1IBJ 
LOCUS 
JUN-2003 
DEFINITION 



CNSLT1IBJ 



1589 bp mRNA linear HTC 18- 



human full-length cDNA 5-PRIME end of clone CS0DB006YD18 of 
Neuroblastoma of Homo sapiens (human) . 
BX248763 

BX248763.1 GI: 28375580 
HTC. 

Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1589) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

Contact : Feng Liang Email : fliang@lifetech.com URL : 
http://fulllength.invitrogen.com/ InVitroGen Corporation 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
REMARK 



1600 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 
Sequencage : 

BP 191 91006 EVRY cedex - FRANCE (E-mail : 
seqref ©genoscope . ens . f r 

- Web : www.genoscope.cns.fr) 

1st strand cDNA was primed with a Notl-oligo (dT) primer. 



Faraday Avenue 

2 (bases 1 to 1589) 

Genoscope . 

Direct Submission 

Submitted ( 13-FEB-2003 ) Genoscope - Centre National de 



COMMENT 
Five prime 

cloned 

Library 

was normalized. Library was constructed by Life 
Technologies , a 

division of Invitrogen. 



end enriched, double-strand cDNA was digested with Not I and 
into the Not I and Eco RV sites of the pCMVSPORT 6 vector. 



FEATURES 

source 



pCMVSP0RT_6 r 
CDS 



Location/Qualifiers 
1. -1589 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DB006YD18" 
/tissue_type="Neuroblastoma " 

/note="end : 5-PRIME~Cot 10-normalized-vector 
619. .>1589 

/note="unnamed protein product" 
/codon_start=l 
/protein_id="CAD6657 0 . 1 " 
/db xref="GI:28375581" 



/ trans la tion="MAWLRLQPLTSAFLHFGLVTFVXFLNGLRAEAGGSGDVPSTGQN 



NESCSGSSDCKEGVILPIWYPENPSLGDKIARVIVYFVALIYMFLGVSIIADRFMASI 



EVITSQEREVTIKKPNGETSTTTIRVWNETVSNLTLMALGSSAPEILLSLIEVCGHGF 

IAGDLGPSTIVGSAAFNMFI I IGICVYVI PDGETRKI KHLRVFFITAAWSI FAYIWLY 

MI LAVFS PGWQWEGLLTLFFFPVCVLLAWVADKRLLFYKYMHKKYRTDKHRGI HE 

TEGDHPKGIEMDGKMMNSHFLDGNLVPLEGKEVDESRREMIRILKDL" 

ORIGIN 

Query Match 35.1%; Score 971; DB 3; Length 1589; 

Best Local Similarity 100.0%; Pred. No. 7.2e-258; 
Matches 971; Conservative 0; Mismatches 0; Indels 0; 
Gaps 0; 

Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 



Db 619 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 67 8 
Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 679 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 738 
Qy 121 

ACAGGGCAGAACAATGAGTCCTGTTCAGGGT CAT CGGACTGCAAGGAGGGTGTCAT CCTG 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 739 

AC AGGGC AGAACAATGAGT C CTGTT CAGGGT CAT CGGACT GCAAGGAGGGT GT CAT CCTG 798 
Qy 181 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 799 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 858 
Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 859 

TTT GT GGCC CTGATAT ACAT GTT CCTT GGGGTGT CCAT CATT GCT GACC GCTT CAT GGCA 918 
Qy 301 

TCTATTGAAGTCAT CACCT CTCAAGAGAGGGAGGT GACAATTAAGAAACCCAATGGAGAA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I 
Db 919 

TCTATT GAAGT CATCACCT CTCAAGAGAGGGAGGT GACAATTAAGAAACCCAATGGAGAA 978 
Qy 361 

AC C AGCACAAC C ACT ATT CGGGT CT GGAAT GAAACTGT CT CCAAC CT GACC CT TAT GGCC 420 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 979 



AC C AGC ACAAC CACT ATT C GGGT CT GGAATGAAACT GT CT CCAACCT GACC CTT ATGGCC 1038 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I 
Db 1039 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 1098 
Qy 481 

AT T GCT GGT GAT CT GGGAC CTT CTACCATT GT AGGGAGT GCAGCCTT CAACAT GT TCATC 540 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1099 

ATT GCT GGT GAT CT GGGAC CTT CTAC CAT TGT AGGGAGT GCAGCCT T CAACAT GT T CATC 1158 
Qy 541 

AT CATT GGCAT CT GTGT CT ACGT GAT CC CAGACGGAGAGACT CGCAAGATCAAGCAT CTA 600 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1159 

AT CATT GGCAT CT GTGT CT ACGT GAT CCCAGACGGAGAGACT CGCAAGATCAAGCAT CTA 1218 
Qy 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 1219 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 1278 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1279 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 1338 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1339 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 1398 
Qy 781 

AT GCACAAAAAGT ACC GCAC AGACAAACACC GAGGAATT ATCAT AGAGAC AGAGGGT GAC 840 

I I II I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1399 

AT GCACAAAAAGT ACC GCAC AGACAAACAC C GAGGAAT T ATCAT AGAGACAGAGGGT GAC 1458 
Qy 841 

CACCCTAAGGGCATTGAGATGGATGGGAAAAT GAT GAATTCCCATTTTCTAGATGGGAAC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1459 

CACCCTAAGGGCATTGAGATGGATGGGAAAAT GATGAATT CCCATTTTCTAGAT GGGAAC 1518 
Qy 901 

CT GGT GC CCCT GGAAGGGAAGGAAGT GGAT GAGT CCCGC AGAGAGATGAT CC GGATT CT C 960 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1519 

CT GGT GC CCCT GGAAGGGAAGGAAGTGGAT GAGT C CC GCAGAGAGAT GAT CCGGATT CT C 1578 

Qy 961 AAGGAT CT GAA 971 

I I I I I I I I I I I 
Db 1579 AAGGAT CT GAA 1589 



RESULT 12 
AK048160 

LOCUS AK048160 3573 bp mRNA linear HTC 03- 

APR-2004 

DEFINITION Mus musculus 16 days embryo head cDNA, RIKEN full-length 
enriched 

library, clone : C130038C08 product : solute carrier family 8 
(sodium/ calcium exchanger), member 1, full insert sequence. 
ACCESSION AK048160 

VERSION AK048160.1 GI:26339181 

KEYWORDS HTC; CAP trapper. 
SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; 
Murinae; Mus. 
REFERENCE 1 

AUTHORS Carninci,P. and Hayashizaki, Y. 
TITLE High-efficiency full-length cDNA cloning 

JOURNAL Meth. Enzymol . 303, 19-44 (1999) 
MEDLINE 99279253 
PUBMED 10349636 
REFERENCE 2 

AUTHORS Carninci,P., Shibata, Y., Hayatsu,N., Sugahara,Y., 
Shibata, K. , 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and 
Hayashizaki, Y. 

TITLE Normalization and subtraction of cap-trapper-selected cDNAs 

to 

prepare full-length cDNA libraries for rapid discovery of 

new genes 

JOURNAL Genome Res. 10 (10), 1617-1630 (2000) 
MEDLINE 20499374 
PUBMED 11042159 
REFERENCE 3 

AUTHORS Shibata, K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., 
Carninci, P. , 

Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., 

Itoh,M., 

Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., 

Harada, A. , 

Yamamoto,R., Matsumoto, H. , Sakaguchi, S . , Ikegami,T., 
Kashiwagi, K. , 

Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., 

Watahiki,M. , 

Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., 

Kawai, J. , 

Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. and 
Hayashizaki, Y. 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

and the 

TITLE 
JOURNAL 

REFERENCE 
AUTHORS 

Research 

TITLE 
annotation 

JOURNAL 
REFERENCE 

AUTHORS 
Carninci, P. , 



RIKEN integrated sequence analysis (RISA) system — 384-format 

sequencing pipeline with 384 multicapillary sequencer 

Genome Res. 10 (11), 1757-1771 (2000) 

20530913 

11076861 

4 

The RIKEN Genome Exploration Research Group Phase II Team 
FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409, 685-690 (2001) 

5 

The FANTOM Consortium and the RIKEN Genome Exploration 
Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 

of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 (bases 1 to 3573) 

Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., 

Fukuda,S., Furuno,M., Hanagaki,T., Hara,A. , Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., 

Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,I., 

Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama, T . , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura, K., Numazaki,R., Ohno,M., 

Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., 

Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi, F. , Takaku- 

Takeda,Y., Tanaka,T., Tomaru,A. , Toya,T., Yasunishi,A. , 
Muramatsu,M. and Hayashizaki, Y. 
Direct Submission 

Submitted ( 16- JUL-2001 ) Yoshihide Hayashizaki, The Institute 

Physical and Chemical Research (RIKEN), Laboratory for 

Exploration Research Group, RIKEN Genomic Sciences Center 

RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, 

Kanagawa 230-0045, Japan (E-mail : genome- res @gsc. riken . jp, 
URL :http: //genome. gsc. riken. jp/, Tel : 81-45-503-9222, 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in 

Genomic Sciences Center and Genome Science Laboratory in 



Hirozane,T. , 

3 

Kasukawa, T . , 

Ohsato,N. , 
Sakazume,N. , 

Akahira, S . , 

TITLE 
JOURNAL 

of 

Genome 
(GSC) , 
Yokohama, 

COMMENT 
Riken 
RIKEN. 

Division of Experimental Animal Research in Riken 
contributed to 

prepare mouse tissues. 

Please visit our web site for further details. 
URL :http: //genome .gsc. riken. jp/ 



URL : http : / / f antom. gsc . riken . j p/ . 
FEATURES Location/Qualifiers 
source 1 . . 3573 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_xr e f = " FANTOM_DB : CI 3 0 0 3 8 CO 8 " 

/db_xref="taxon: 10090" 

/clone="C130038C08" 

/tissue_type="head" 

/clone_lib="RIKEN full-length enriched mouse cDNA 

library" 

/dev_stage="16 days embryo" 
CDS <1. .2122 

/note="unnamed protein product; putative 

solute carrier family 8 ( sodium/ calcium exchanger) , 

member 

1 (MGD | MGI: 107956, GB | NM_011406, evidence: BLASTN, 

99%, 

match=1583) " 
/ codon_start=2 
/protein_id="BAC33262 . 1" 
/db_xref="GI: 26339182" 

/translation="EITIKKPNGETTKTTVRIWNETVSNLTLMALGSSAPEILLSVIE 

VCGHNFTAGDLGPSTIVGSAAFNMFIIIALCVYWPDGETRKIKHLRVFFVTAAWSIF 

AYTWLYIILSVSSPGVVEWEGLLTFFFFPICWFAWVADRRLLFYKYVYKRYRAGKQ 

RGMIIEHEGDRPASKTEIEMDGKVWSHVT)NFLDGALvXEVT)ERDQDDEEARRa 

LKELKQKHPEKEIEQLIELANYQVXSQQQKSRAFYRIQATRLMTGAGNILKRHAADQA 

RKAVSMH EVNMEMAEN DP VS K I FFEQ GT YQ C L EN C GTVALT IMRRGGDL S T T VFVD FR 

TEDGTANAGSDYEFTEGTVIFKPGETQKEIRVGIIDDDIFEEDENFLVHLSNVRVSSD 

VSEDGILESNHASSIACLGSSSTATITIFDDDHAGIFTFEEPWHVSESIGIMEVKVL 

RTSGARGNVIIPYKTIEGTARGGGEDFEDTCGELEFQNDEIVKTISVKVIDDEEYEKN 

KTFFIEIGEPRLVEMSEKKGGFTLTEEYDDKQPLTSKEEEERRIAEMGRPILGEHTKL 

EVIIEESYEFKSTVDKLIKKTNLALWGTNSWREQFIEAITVSAGEDDDDDECGEEKL 

PSCFDYVMHFLTVFWKVliFAFVPPTEYWNGWACFIVSimiGLLTAFIGDLASHFGCT 

IGLKDSVTAWFVALGTSVPGPTL" 

ORIGIN 

Query Match 34.3%; Score 948.4; DB 3; Length 3573; 

Best Local Similarity 67.8%; Pred. No. 1.9e-251; 
Matches 1434; Conservative 0; Mismatches 616; Indels 66; 
Gaps 5; 

Qy 331 

GAGGT GACAATTAAGAAACCCAATGGAGAAACCAGCACAACCACTATTCGGGTCT GGAAT 390 
II I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

I I I I I II 

Db 2 



GAAATAAC GATAAAGAAACCGAAT GGAGAGAC CAC CAAGACGAC GGT GAGAAT CT GGAAC 6 1 



Qy 391 

GAAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCCTCTGCTCCTGAGATACTCCTC 450 

II I I I I I II I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

I I I I I 

Db 62 

GAGACT GT GT C GAAC CT GAC CTT GAT GGCC CT GGGAT CTT CT GCT CCT GAGATT CT C CTG 121 
Qy 451 

TCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGATCTGGGACCTTCTACCATT 510 
II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 122 

TCAGTCATTGAAGTGTGCGGCCATAACTTCACCGCAGGGGACCTGGGTCCCAGCACCATC 181 
Qy 511 

GT AGGGAGT GCAGC CTT CAACAT GT TCATCAT C ATTGGCAT CT GT GT CTACGTGAT C C CA 570 

II II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I 

I I I I 

Db 182 

GT GGGAAGT GCT GC CTT TAAC AT GTT CATCATAAT CGCACT CT GT GT TTACGT GGT C CCT 241 

Qy 571 

GACGGAGAGACT CGCAAGAT CAAGC AT CT ACGAGT CTT CTT CAT CAC C GCT GCT T GGAGT 630 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 242 

GATGGAGAGACAAGGAAGAT CAAGCAT CTGCGT GT GTT CTT T GT GAC AGCAGCCT GGAGC 301 
Qy 631 

ATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGGTCCAG 690 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 302 

ATCTTTGCCTATACCTGGCTTTATATAATCTTGTCTGTCAGCTCTCCTGGAGTTGTGGAG 361 
Qy 691 

GTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCTGGGTG 750 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 362 

GTCTGGGAAGGCTTGCTTACTTTCTTCTTCTTTCCCATCTGCGTTGTGTTCGCGTGGGTA 421 
Qy 751 

GCAGATAAACGACT GCT CT T CT ACAAATACAT GCACAAAAAGT ACC GC ACAGACAAACAC 810 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 422 

GCAGACAGGCGGCTT CT CTTTTACAAGTAT GT CTACAAGCGGT ACAGGGCCGGCAAGCAG 481 

Qy 811 C GAGGAAT TAT CAT AGAGAC AGAGGGT GAC CAC C C 

TAAGGGC ATT GAGAT G 861 

I I I II I I I I I I I I I I I I II II I I I I I I 

I I I 

Db 482 

AGGGGGAT GAT C ATT GAACAT GAAGGAGAC AGAC CAGCTT C CAAAACT GAAAT CGAAAT G 541 
Qy 8 62 

GAT GGGAAAAT GAT GAATT C CC AT T T T CT AGAT GGGAACCT GGT GC CCCT GG 913 



1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II I I I 1 1 1 1 

Db 542 

GAT GGGAAAGT GGTCAACT CT CAT GTT GACAATTT CTT AGATGGGGCT CT GGTTTT GGAA 601 
Qy 914 

AAGGGAAGGAAGTGGATGAGT CCCGCAGAGAGAT GATCCGGATTCTCAAG 963 

I I I I I I I I I II 1 I I II I I I I I I I I I I 

I I I 

Db 602 

GTT GAT GAGAGGGACCAAGAT GAT GAGGAAGCC AGGCGT GAGAT GGCAAGGATTCT GAAG 661 
Qy 964 

GAT CT GAAGCAAAAAC AC C C AGAGAAGGACTT AGAT CAGCT GGT GGAGAT GGC CAAT TAC 1023 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I 

I I I 

Db 662 

GAACTTAAGCAGAAGC AT C CT GAGAAAGAAATT GAGCAAT TAAT AGAATT AGC CAACT AC 721 
Qy 1024 

TAT GCT CTT T CC CACCAACAGAAGAGCCGC GC CT T CT ACCGTAT CCAAGCCACTC GT AT G 1083 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 722 

CAGGTCCTAAGTCAACAGCAGAAAAGCCGAGCATTTTACAGGATTCAAGCTACTCGCCTG 781 
Qy 1084 

ATGACT GGTGCAGGCAATATCCTGAAGAAACAT GCAGCAGAACAAGCCAAGAAGGCCT CC 1143 

I I I I I II II I I I I I III I I I I I I II I I I I I II I I I I I I I I I I I I 

I 

Db 782 

AT GAC C GGAGCT GGCAACAT CTTGAAGAGGCACGCAGCTGATCAAGCAAGGAAGGCT GT C 841 

Qy 1144 AGCAT GAGCGAGGTGCACACCGAT GAGCCT G 

AGGACTTTATTTCCAAGGTCTTCTTT 1200 

I I I I I I I II III II I I I I I I I I I III 

I I I I I I I I 

Db 842 

AGT AT GCAT GAAGT CAACAT GGAAAT GGCT GAAAAC GAC C CAGT CAGTAAGAT CT T CTT T 901 
Qy 1201 

GACCCAT GTT CTTACCAGTGCCT GGAGAACT GTGGGGCTGTACTCCT GACAGT GGTGAGG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 902 

GAGCAAGGAACAT ACCAGT GT CT AGAGAACT GT GGT ACTGT GGCCCT CAC CATT AT GC GC 961 
Qy 1261 

AAAGGGGGAGACATGTCAAAGACCAT GTATGT GGACTACAAAACAGAGGATGGTTCTGCC 1320 



Db 962 

AGAGGGGGCGACTTGAGCACCACT GTGTTT GTTGACTTCAGGACAGAAGACGGCACAGCC 1021 
Qy 1321 

AAT GC AGGGGCT GACT AT GAGTT CACAGAGGGCACGGT GGTT CT GAAGC C AGGAGAGAC C 1380 

I I I I I III MM II I II II I II II II II III I I II I II II 

I II M I 

Db 1022 

AAT GCTGGGT CTGATT AT GAATTCACGGAAGGGACT GT GATCTTCAAACCAGGGGAGACC 1081 



Qy 1381 

CAGAAGGAGT TCT CCGT GGGC ATAATT GAT GACGACAT TT T TGAGGAGGATGAAC ACTT C 1440 

I I 1 I II I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1082 

CAGAAGGAAAT CAGAGT T GGC AT CAT T GAT GAT GAT AT CT TT GAAGAAGAT GAAAACTT C 1141 



Qy 1441 

TTT GTAAGGT TGAGCAAT GTC C GC AT AGAGGAGGAGCAGCCAGAGGAGGGGAT GCCT C CA 1500 

I I I I I I I I II I I I I I I II I I I I I I I I I I 

I 

Db 1142 

CTT GT GCAT CTT AGCAAT GT CAGAGT CT CTT C AGATGTTT CAGAAGAT GGCAT ACT AGAA 1201 



Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 
I II II II I III I I I II I I 

I I I I I 

Db 1202 TC 

CAAT CACGCTTCTT CAATT G CT T GT CTT GGGT CAT CCAGCACT GCCACC 1252 



Qy 1561 

GTT ACCAT CT TGGAT GAT GAC C AT GC AGGCAT CTT CACTTTT GAAT GT GAT ACT ATT CAT 1620 

I I I I I I II I I I I I I I I I I I I I I I I II I I I I I II I I I I I I 

I I I 

Db 1253 

AT AACCATT T TT GAT GAT GAC CAT GC AGGCAT CTTT ACAT T T GAGGAACC C GT GACT CAC 1312 



Qy 1621 

GT C AGT GAGAGT ATT GGT GTT AT GGAGGT CAAGGTT CT GC GGACAT CAGGT GCC CGGGGT 1680 

II II I I I I I I I I I I I I I II I I I I I I I I I I II I II II II II II 

I I 

Db 1313 

GT GAGC GAGAGCATT GGCAT CAT GGAGGT GAAGGT TT T GAGAAC CT CT GGAGCT C GAGGA 1372 



Qy 1681 

ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 1740 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 1373 

AAT GTT ATCATT C CCTACAAAACT AT T GAAGGCAC AGCCCGAGGTGGAGGGGAAGACTTT 1432 
Qy 1741 

GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 1800 

II I II I I I III III I I I I I I I I I I I I I I I I I I II I I I I I III 

I I 

Db 1433 

GAGGAC ACCT GTGGAGAGCT C GAATT C CAGAACGAT GAAAT AGT CAAAACAAT AT CAGT C 14 92 



Qy 1801 

AAAAT AGT AGAT GAGGAGGAAT AC GAAAGGCAAGAGAATTT CTT CATT GC CCT T GGT GAA 1860 

II I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I 

I I 

Db 1493 

AAGGTAATCGAT GAC GAGGAGT AT GAGAAAAACAAGAC CTT CTT CAT T GAGATTGGAGAG 1552 

Qy 1861 C C GAAAT GGAT GGA 

AC GT GGAAT AT C AGAT GT G 1893 

II I I I I I I I I I I I 

Db 1553 



CCC CGT CTGGT GGAGAT GAGT GAGAAGAAAG GT GGCTT CAC ATTAAC AGAGGAAT ATGAT 1612 



Qy 1894 

ACAGACAGGAAGCT GACT AT GGAAGAAGAGGAGGC CAAGAGGAT AGC AGAGAT GGGAAAG 1953 

I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I 

Db 1613 

GACAAGCAGC CACT GACC AGCAAAGAAGAGGAGGAGAGGC GCAT T GC GGAAAT GGGGC GC 1672 
Qy 1954 

CCAGTATTGGGTGAACACCCCAAACTAGAAGTCAT CATTGAAGAGT CCTATGAGTTCAAG 2013 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 1673 

C CCAT CCT AGGC GAGC ACAC CAAGCT GGAGGT GAT CAT C GAAGAGT CT T AC GAATT CAAG 1732 
Qy 2014 

ACT ACGGT GGACAAACT GAT CAAGAAGACAAAC CT GGC CTT GGT T GT GGGGACC CATT CC 2073 

I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

I 

Db 1733 

AGCACT GT GGACAAACT C ATTAAGAAGAC GAACCT GGC C CTT GT GGT GGGGACCAACAGC 1792 
Qy 2074 

TGGAGGGACCAGTT CAT GGAGGCCATCACCGTCAGTGCAGCAGGGGAT GAGGATGAGGAT 2133 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1793 

T GGAGAGAGC AGT TTAT CGAAGCC AT CACT GT CAGCGCT GGGGAAGAT GACGAT GATGAT 1852 
Qy 2134 

GAATCCGGGGAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTC 2193 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 

I I 

Db 1853 

GAATGTGGGGAGGAGAAGCTGCCCTCCTGTTTTGATTACGTGATGCACTTTCTCACAGTG 1912 * 
Qy 2194 

TTCTGGAAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGC 2253 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

I I I I I I I I I I I I 
Db 1913 

TTCTGGAAGGTTCTGTTTGCCTTCGTCCCACCTACAGAATACTGGAATGGCTGGGCCTGC 1972 
Qy 2254 

TT C GC CGT CT CCAT CCT CAT CATT GGCATGCTCAC CGC CAT CAT T GGGGACCTGGC CT CG 2313 

III I I I I I I I I I I I I I I II III I II I I I I I I I I I I I I I I I I I I I II 

I I 

Db 1973 

TT C ATT GT CT C CAT C CT C AT GAT C GGC CTACT GACCGC CT T CAT T GGAGAC CT GGCTT CC 2032 
Qy 2314 

CACTTCGGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTT 2373 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I I II II II 

I I 

Db 2033 

CACTTTGGCTGCACCATTGGTCTGAAAGATTCCGTGACTGCCGTTGTGTTTGTTGCTCTT 2092 

Qy 2374 GGCACCTCTGTCCCAG 2389 

II I I I I I II I I II 

Db 2093 GGAACCTCGGTGCCAG 2108 



linear EST 23- 



RESULT 13 
BX374548 

LOCUS BX374548 938 bp * mRNA 

APR-2004 

DEFINITION BX374548 Homo sapiens NEUROBLASTOMA COT 10-NORMALIZED Homo 
sapiens 

cDNA clone CS0DB006YD18 5-PRIME, mRNA sequence. 
ACCESSION BX374548 

VERSION BX374548.2 GI: 46558208 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 938) 

AUTHORS Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
TITLE Full-length cDNA libraries and normalization 

JOURNAL Unpublished (2001) 
COMMENT On May 8, 2003 this sequence version replaced gi:30438490. 

Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
1st strand cDNA was primed with a Notl-oligo (dT) primer. 



end enriched, double-strand cDNA was digested with Not I and 
into the Not I and EcoR V sites of the pCMVSPORT 6 vector. 



Five prime 
cloned 
Library 

was normalized. Library was constructed by Life 
Technologies, a 

division of Invitrogen. This sequence belongs to sequence 

cluster 

7256. r 

For more information about this cluster, see 

http : //www. genoscope . ens . f r/cdna?s=CSODB006DB09_DB1287_2&c= 

7256. r. 

FEATURES Location/Qualifiers 
source 1. .938 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon:9606" 
/clone="CS0DB006YD18" 

/ tissue_type="NEUR0BLAST0MA COT 10-NORMALIZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 10- 



NORMALIZED" 
(dT) 
was 

EcoR V 

normalized. 1 
ORIGIN 



/note="lst strand cDNA was primed with a Notl-oligo 
primer. Five prime end enriched, double-strand cDNA 
digested with Not I and cloned into the Not I and 
sites of the pCMVSPORT 6 vector. Library was 



Query Match 



32.5%; Score 899.2; DB 5; Length 938; 



Best Local Similarity 97.8%; Pred. No. 5.4e-238; 
Matches 918; Conservative 0; Mismatches 20; Indels 1 
Gaps 1 ; 

Qy 687 

CCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCTG 746 

I I I I I I I I 1 I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I 

Db 1 CCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCC- 

GTGTGTGTCCTTCTGGCCTG 59 

Qy 747 

GGTGGCAGATAAACGACT GCT CTT CTACAAATACATGCACAAAAAGTACCGCACAGACAA 806 

I I II I I I I I I I I I I I I I I I I I I II I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 60 

GGTGGCAGATAAACGACT GCT CTT CTACAAATACATGCACAAAAAGTACCGCACAGACAA 119 
Qy 807 

ACAC C GAGGAATT AT CAT AGAGAC AGAGGGT GACCACC CTAAGGGCAT TGAGAT GGAT GG 866 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 120 

AC ACCGAGGAATT AT CAT AGAGACAGAGGGT GACCACCCTAAGGGCAT TGAGAT GGAT GG 179 
Qy 867 

GAAAAT GAT GAATT C CC ATT TT CT AGAT GGGAAC CT GGTGCC CCT GGAAGGGAAGGAAGT 926 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 180 

GAAAAT GAT GAATT CCCAT TT T CT AGAT GGGAAC CTGGT GC CCCT GGAAGGGAAGGAAGT 239 
Qy 927 

GGATGAGTCCCGCAGAGAGATGATCCGGATT CTCAAGGAT CTGAAGCAAAAACACCCAGA 986 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 240 

GGAT GAGT C CC GC AGAGAGAT GAT CCGGATT CT CAAGGAT CT GAAGCAAAAACAC C C AGA 299 
Qy 987 

GAAGGACTTAGATCAGCTGGTGGAGAT GGCCAATTACTATGCTCTTT CCCACCAACAGAA 1046 

II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 300 

GAAGGACTTAGATCAGCTGGT GGAGATGGCCAATTACTAT GCTCTTT CCCACCAACAGAA 359 
Qy 1047 

GAGC C GC GCCT T CT ACC GT AT C CAAGCCACT CGT ATGAT GACT GGT GC AGGCAAT AT CCT 1106 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 360 

GAGCCGCGCCTTCTACCGTATCCAAGCCACTCGTATGATGACTGGTGCAGGCAATATCCT 419 
Qy 1107 

GAAGAAAC AT GC AGC AGAACAAGC CAAGAAGGC CT C CAGCAT GAGCGAGGT GC ACAC C GA 1166 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 420 

GAAGAAACAT GCAGCAGAACAAGCCAAGAAGGC CT CCAGCAT GAGCGAGGT GC AC AC C GA 479 



Qy 1167 

TGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTTGACCCATGTTCTTACCAGTGCCTGGA 1226 



I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 480 

T GAGCCT GAGGACTT T ATT T CCAAGGT CT T CTT T GACC CAT GTT CTTACC AGT GC CT GGA 539 
Qy 1227 

GAACTGTGGGGCT GTACTCCTGACAGTGGT GAGGAAAGGGGGAGACATGTCAAAGACCAT 1286 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 540 

GAACT GTGGGGCT GTACTCCTGACAGTGGT GAGGAAAGGGGGAGACATGTCAAAGACCAT 599 
Qy 1287 

GTAT GT GGACT ACAAAAC AGAGGAT GGTT CT GCCAAT GCAGGGGCT GACT AT GAGTT C AC 134 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 600 

GTAT GT GGACTACAAAACAGAGGAT GGTT CT GC CAAT GCAGGGGCT GACT AT GAGTT CAC 659 
Qy 1347 

AGAGGGCAC GGT GGT T CT GAAGCC AGGAGAGAC CCAGAAGGAGTT CT CC GT GGGCATAAT 1406 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 660 

AGAGGGCACGGT GGTT CT GAAGC C AGGAGAGAC CCAGAAGGAGTT CT CC GT GGGCATAAT 719 
Qy 1407 

T GAT GAC GAC ATTTTT GAGGAGGAT GAAC ACTT CTTT GTAAGGTTGAGCAAT GTCCGCAT 1466 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I 
I I I I I I I I I I I I I I I I I I I I 
Db 720 

T GAT GAC GACAT T TT T GAGGAGGAT GAAC ACTT CTT T GNT AGGT TGAGCAAT GTC C GCAT 779 
Qy 1467 

AGAGGAGGAGC AGCCAGAGGAGGGGAT GCCT CCAGCAAT ATT CAACAGT CTT CCCT TGC C 1526 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I 

Db 780 

AGAGGAGGAGCAGCCAGAGGAGGGGATGCCTNNCAGCATATTCAAACAGTCTTCCTTGCC 839 



Qy 1527 

TCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGATGATGACCATGC 1586 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 840 

TCGGGCTGTCCTAGCCTNCCCTTGNGTGGNCACAGTTACCATCTTGGATGATGACCATGC 899 

Qy 1587 AGGCAT CT T C ACT TTT GAAT GT GAT ACTATT CAT GT CAG 1625 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 900 N AGCATCT T C ACTTTT GAAT GT GAT ACT ATT CATGT CAG 938 



RESULT 14 
AY408695 

LOCUS AY408695 2515 bp DNA linear GSS 

DEC-2003 

DEFINITION Mus musculus HCM3309 gene, VIRTUAL TRANSCRIPT, partial 
sequence, 



genomic survey sequence. 
ACCESSION AY408695 

VERSION AY408695.1 GI: 39764666 

KEYWORDS GSS . 

SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; 
Murinae; Mus. 

REFERENCE 1 (bases 1 to 2515) 

AUTHORS Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., 
Kejariwal, A. , 

Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
TITLE Inferring nonneutral evolution from human- chimp -mouse 

orthologous 

gene trios 

JOURNAL Science 302 (5652), 1960-1963 (2003) 
PUBMED 14671302 
REFERENCE 2 (bases 1 to 2515) 

AUTHORS Clark, A. G. , Glanowski, S . , Nielson,R., Thomas, P., 
Kejariwal, A. , 

Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
TITLE Direct Submission 

JOURNAL Submitted ( 16-NOV-2003 ) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and 



COMMENT 
ordering 

FEATURES 

source 



gene 
ORIGIN 

Query Match 



them based on alignment. 

Location/Qualifiers 
1. .2515 

/organism="Mus musculus" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 10090" 
<1. .>2515 
/locus_tag="HCM3309" 



31.6%; Score 874.8; DB 9; 
Best Local Similarity 63.9%; Pred. No. 4.8e-231; 

Matches 1348; Conservative 0; Mismatches 708; Indels 
Gaps 6; 



Length 2515; 

52; 



Qy 691 

GTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCTGGGTG 750 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I 

I I 

Db 426 

GTGTGGGAGGCACTGCTCACACTGATCTTCTTCCCGGTGTGTGTGGTGTTTGCCTGGATG 485 



Qy 751 

GC AGATAAACGACTGCTCT T CT ACAAAT AC AT GC ACAAAAAGT ACC GCAC AGACAAAC AC 810 

II II II I I I I I I I I I I I I I I I I I III II I I I I I I II I I I I III 

I 

Db 486 

GCGGACAAGC GACTGCT CTTCT ACAAGT AC GT GT ACAAGCGCTACCGCAC CGACCCT C GC 545 



Qy 811 

C GAGGAATT ATC ATAGAGACAGAGGGT GAC CAC C CTAAGGGCAT TGAGAT GGATGGGAAA 870 

I 1 I I I I I I I I I I I I I I I I I I I I I I II III I I I I II I I I II II I 

I 

Db 546 

AGT G GAAT CATC ATC GGGGCAGAGGGAGAC CCAC CCAAGAGCAT CGAGCT GGACGGCACA 605 

Qy 871 AT GAT GAAT T C C CAT T T T C TAG AT G 

GGAACCTGGTGCCCCTGGAAGGG 918 

III I I I I I I I I I I I III 

I 

Db 606 

TTCGTGGGCACTGAGGTCCCTGGCGAGCTGGGCGCATTGGGCACAGGTCCTGCTGAGGCG 665 
Qy 919 

AAGGAAGT GGAT GAGT CC C GCAGAGAGAT GAT C CGGATTCT CAAGGAT CT GAAGCAAAAA 978 

I I I I I I I I III I I II I I I I I I I I I I I I I I I II II I I I 

I I 

Db 666 

CGT GAACTAGAT GCC AGCC GGC GT GAGGTC AT C CAGAT CCTTAAGGACTT GAAGC AGAAG 725 
Qy 979 

CACC CAGAGAAGGACTT AGAT C AGCTGGT GGAGAT GGCCAATT ACT ATGCTCT TT CCCAC 1038 

II I I I II I I I II I I II I I I I I I III II I I I I I I I I I I I I I II 

I I I 

Db 726 

CACCCGGATAAGGACCTGGAGCAGCTGAT GGGCAT CGCCAAGTACTATGCACTGCTGCAC 785 
Qy 1039 

CAACAGAAGAGC CGCGCCTT CT AC CGT AT C CAAGC CACT CGT AT GAT GACT GGTGCAGGC 1098 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I 

I I I 

Db 786 

CAGCAGAAGAGCCGCGCCTTCTACCGCATCCAGGCCACGCGGCTGATGACAGGTGCGGGC 845 
Qy 1099 

AAT AT C CT GAAGAAACAT GCAGCAGAACAAGCCAAGAAGGCCT C CAGCAT GAGCGAGGT G 1158 

I I I I I I I I I I I I I I I I I I III III II I I 

Db 84 6 

AATGTGCTGCGCAGACATGCTGCGGATGCTGCCCGCAGGCCGGGAGCCACCGATGGTGCC 905 
Qy 1159 

CACAC CGAT GAGCCT GAGGACTTTATTT C CAAGGT CTT CTT T GACC C AT GTT CTT ACC AG 1218 

II I I I I II I I I I II II I I I I I I I I I I I I I II 

I I 

Db 906 C C C GAT GAT GAG GAT GAT G GT G C C AG T C G C A 

T CT T CTT T G AGC C CAG C CT CT AT CAC 962 

Qy 1219 

TGCCT GGAGAACT GT GGGGCT GTACTCCTGACAGT GGT GAGGAAAGGGGGAGACATGT CA 1278 

I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II 

Db 963 

TGCCTGGAGAACTGCGGGTCAGTGCTGCTGTCCGTGGCTTGCCAGGGTGGTGAGGGCAAC 1022 
Qy 1279 

AAGAC CAT GT AT GT GGACT ACAAAAC AGAGGAT GGTT CT GCCAAT GC AGGGGCTGACT AT 1338 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

I I I 

Db 1023 



AGCACCTTCTACGTGGACTACCGTACTGAGGACGGTTCTGCAAAGGCAGGCTCCGATTAT 1082 
Qy 1339 

GAGTT CAC AGAGGGCACGGTGGTT CT GAAGC C AGGAGAGAC C C AGAAGGAGTT CT CCGT G 1398 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1083 

GAGT ACAGCGAGGGCACACTGGT GT T CAAGCC C GGGGAGAC GC AGAAGGACCT GC GCAT C 1142 
Qy 1399 

GGCATAATT GAT GACGACATTTTTGAGGAGGATGAACACTTCTTTGTAAGGTT GAGCAAT 1458 

II II II II I I I I I I I I II I II II I I I II I I I I I I I I II III II 

I I 

Db 1143 

GGTATCATCGACGACGACATCTTCGAGGAGGACGAGCACTTCTT CGT GAGGCT GCTGAAC 1202 
Qy . 1459 

GT C C GCAT AGAGGAGGAGC AGC CAGAGGAGGGGAT GC CT C CAGCAATATT CAACAGT CTT 1518 

I I I I I I I I I I I I II I I I I 

I 

Db 1203 CTGCGTGTGGGCGATGCTCAGGGCATGTTCGAG 

CCCGACGGCGGT 1247 

Qy 1519 

CCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGATGAT 1578 

III II III III I I I I I I I I I I I I I I I I I I I 

I I 

Db 1248 

GGGCGGCCCAAGGGGCGGCTGGTGGCGCCGCTGCTGGCCACTGTCACCATCCTGGACGAC 1307 
Qy 1579 

GAC CAT GC AGGCAT CT T C ACTT TT GAAT GT GAT ACT AT T CAT GT CAGT GAGAGT AT T GGT 1638 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1308 

GACCACGCGGGCATCTTCTCCTTCCAGGACCGCCTGCTGCATGTGAGCGAGTGCATGGGC 1367 
Qy 1639 

GTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGTACAGTCATCGTCCCCTTT 1698 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1368 

ACTGTGGATGTGCGCGTGGTTCGCAGCTCGGGCGCCCGTGGCACTGTACGCCTCCCCTAC 1427 
Qy 1699 

AGGACAGT AGAAGGGACAGCCAAGGGT GGCGGT GAGGACT T T GAAGACAC AT AT GGGGAG 1758 

I I I I I I I I I I I I I I I I I II II II II I I I I I I I I I 

I I I 

Db 1428 

CGCACAGTGGACGGCACGGCCCGTGGCGGTGGTGTACATTACGAGGATGCTTGTGGAGAG 1487 
Qy 1759 

T T GGAATT CAAGAAT GAT GAAACT GT GAAAACC ATAAGGGT T AAAAT AGT AGAT GAGGAG 1818 

I I I I I I I I I I II I I I I II I II I I I I I I I I I II I I I I II 

I I 

Db 1488 CTGGAGTTCGGCGATGATGAGAC- 

CAGAAAACT CT T CAGGT CAAGAT AGT GGAT GAT GAA 1546 



Qy 1819 

GAATACGAAAGGCAAGAGAATTTCTTCATT GCCCTTGGTGAACCGAAAT GGAT GGAACGT 1878 

I I I I I I I I I I I I I I I I I I I I I I I MM III Mill I 



1 1 

Db 1547 

GAGT AT GAGAAGAAGGACAACTT CTT C ATT GAGCT GGGCCAGC CCC AGT GGCTTAAGCGA 1606 

Qy 1879 GGAAT AT C AGAT GT GAC 

AGACAGGAAGCTGACTATGGAAGAA 1920 

I I I I I I I I I I I I I I II I I I I I II 

I 

Db 1607 

GGC AT CT CAGNNNNNNNNNNNNNNNNNNGGAAT GGAGACAAGAAGATAACT GCAGAGC AG 1666 
Qy 1921 

GAGGAGGCCAAGAGGAT AGCAGAGAT GGGAAAGCCAGT ATT GGGT GAACAC CCCAAACTA 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II 

I I 

Db 1667 

GAGGAGGCCCAGAGGAT AGCAGAGAT GGGCAAGCCAGT T CT T GGGGAGAACAAT CGCCTC 1726 
Qy 1981 

GAAGT CAT C ATT GAAGAGT C CT AT GAGTT CAAGACT AC GGT GGACAAACT GAT CAAGAAG 2040 

II I I I I I I I I II I I I I I I I I I I II III 

Db 1727 

GAGGTCATCATCGAGGAGTCTTATGACTTTAAGNNNNNNNNNNNNNNNNNNNNNNNNNNN 1786 
Qy 2041 

ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 2100 
Db 1787 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 1846 
Qy 2101 AC CGT CAGT GCA 

GCAGGGGATGAGGAT GAGGAT GAAT C CGGGGAGGAGAGGCT GC C C 2157 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I 

Db 1847 

NNNNNNNNNNNNNGGGACGAGGAGGAGGAT GAGGATGGACCT CGT GAGGAGCGGCTGCCG 1906 
Qy 2158 

TCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGT 2217 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I II I I I I I II II II 

I I I I I 

Db 1907 

TCCTGCTTTGACTACGTGATGCACTTCCTGACGGTATTCTGGAAAGTTCTATTCGCCTGC 1966 
Qy 2218 

GTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATT 2277 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1967 

GTTCCACCCACGGAGTACTGCAATGGCTGGGCCTGCTTTGGTGTCTGCATCCTGGTCATT 2026 
Qy 2278 

GGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTC 2337 

II I I I I I I I III I I I I II I I I I I I I I I II II I I I II I I I I I I I II 

I I I 

Db 2027 

GGTGTGCTCACTGCCCTCATCGGAGACCTGGCCTCACACTTTGGGTGCACCGTGGGCCTC 2086 



Qy 2338 

AAAGAT T CAGT CACAGCT GT T GT TT T C GT GGC ATTT GGCAC CT CT GTCC CAGAT ACGTTT 2397 



I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 

Db 2087 

AAGGACTCAGTCAACGCCGTGGTCTTCGTAGCCCTGGGCACCTCCATCCCTGACACGTTC 2146 
Qy 2398 

GC CAGCAAAGCT GCT GCC CT C CAGGAT GT AT AT GCAGACGCCT CCATTGGCAACGT GAC G 2457 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I ! I I I 

I I I I I I I I 

Db 2147 

GCCAGCAAGGTGGCCGCGCTGCAGGACCAGTGTGCCGACGCGTCCATCGGTAACGTGACC 2206 
Qy 2458 

GGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATC 2517 

III I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2207 

GGCTCCAATGCGGTGAACGTGTTCCTGGGCCTGGGTGTGGCCTGGTCGGTGGCAGCGGTG 2266 
Qy 2518 

TACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTC 2577 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2267 

TACTGGGCGGTGCAGGGTCGCCCCTTCGAGGTGCGTGCAGGCACGCTGGCCTTCTCGGTC 2326 
Qy 2578 

ACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCG 2637 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

I I I I I I 

Db 2327 

ACGCTGTTCACCGTCTTCGCCTTCGTGTGCATCGCAGTGCTGTTGTACCGGCGCCGGCCG 2386 
Qy 2638 

CACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTT 2697 

II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

I I I I I 

Db 2387 

CAGATTGGCGGCGAGCTGGGCGGCCCGCGGGGACCCAAGCTGGCCACCACCGCTCTCTTC 2446 
Qy 2698 

GT GAGC CT GT GGCTC CT CT ACAT ACT CTT T GCCACACT AGAGGCCT ATT GCT ACAT CAAG 2757 

I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I 

I 

Db 2447 

CTGGGCCTCTGGTTCCTCTACATTCTCTTCTCCAGCCTGGAGGCTTACTGCCACATCCGG 2506 

Qy 2758 GGGTTCTA 2765 

II I I I II 
Db 2507 GGCTTCTA 2514 



RESULT 15 
BI913344 

LOCUS BI913344 887 bp mRNA linear EST 16 

OCT-2001 

DEFINITION 603178823F1 NIH_MGC_121 Homo sapiens cDNA clone 
IMAGE: 5243308 5 f , 

mRNA sequence. 
ACCESSION BI913344 



VERSION BI913344.1 GI : 16177710 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 887) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection 



REFERENCE 
AUTHORS 
TITLE 

(MGC) 

JOURNAL 

COMMENT 



be 



FEATURES 

source 



NotI; 
pool of 3 
weeks , 
and 

range 

Gruber 
017. Note: 
ORIGIN 



Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r @mail . nih . gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can 

found through the I.M.A.G.E. Consortium/LLNL at: 

http : // image . llnl . gov 

Plate: LLAM11613 row: m column: 05 

High quality sequence stop: 782. 

Location/Qualifiers 

1. .887 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/ clone= " IMAGE :5243308" 

/lab_host="DH10B" 

/ cl one_l ib= "N I H _MGC_1 2 1 " 

/note="Organ: brain; Vector: pCMV-SPORT6; Site_l: 

Site_2: EcoRV (destroyed); RNA source anonymous 

fetal brains, female age 20 weeks, female age 24 

and male age 26 weeks. Library is oligo-dT primed 

directionally cloned (EcoRV site is destroyed upon 
cloning). Average insert size 1.7 kb, insert size 

0.7-3.5 kb. Library is normalized and enriched for 
full-length clones and was constructed by C. 

(Invitrogen) . Research Genetics tracking code 

this is a NIH_MGC Library." 



Query Match 29.4%; Score 813.8; DB 4 

Best Local Similarity 97.2%; Pred. No. 2.9e-214 

Matches 860; Conservative 0; Mismatches 22 
Gaps 3 ; 



Length 887; 
Indels 3; 



Qy 916 

GGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATT CT CAAGGATCTGAAGCAA 975 



I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 



Db 1 

GGGAAGGAAGT GGAT GAGTCCC GCAGAGAGAT GAT CCGGATTCT CAAGGATCT GAAGCAA 60 
Qy 976 

AAACACCC AGAGAAGGACTT AGAT CAGCT GGTGGAGAT GGC CAATT ACTAT GCT CTTT CC 1035 

I I I II I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 

AAACACC C AGAGAAGGACT T AGAT CAGCTGGT GGAGAT GGC CAATT ACTAT GCT CTTT CC 120 
Qy 1036 

CAC CAAC AGAAGAGCCGCGC CTT CT ACC GT AT CCAAGCCACT C GT AT GAT GACT GGT GCA 1095 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 

CAC CAAC AGAAGAGCCGCGC CTT CT ACCGT AT CCAAGC CACT C GT AT GAT GACT GGT GCA 180 
Qy 1096 

GGCAATAT CCTGAAGAAACAT GCAGCAGAACAAGCCAAGAAGGCCTCCAGCATGAGCGAG 1155 

I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 181 

GGCAATAT C CT GAAGAAACAT GCAGCAGAACAAGC CAAGAAGGC CT CCAGCAT GAGC GAG 240 
Qy 1156 

GT GC ACAC C GAT GAGCCT GAGGACTT T ATT T C CAAGGT CTT CT T T GAC CCATGTT CT T AC 1215 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 

GT GCACACCGAT GAGCCT GAGGACTTTATTT CCAAGGTCTT CTTTGACCCATGTTCTTAC 300 
Qy 1216 

C AGT GCCT GGAGAACT GT GGGGCT GTACTCCT GAC AGT GGT GAGGAAAGGGGGAGACAT G 1275 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 

CAGTGCCT GGAGAACTGT GGGGCTGTACTCCTGACAGTGGT GAGGAAAGGGGGAGACAT G 360 
Qy 1276 

TCAAAGACCATGTAT GT GGACTACAAAACAGAGGATGGTTCTGCCAATGCAGGGGCTGAC 1335 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I 
Db 361 

TCAAAGACCATGTATGT GGACTACAAAACAGAGGATGGTT CTGCCAATGCAGGGGCTGAC 420 
Qy 1336 

TATGAGTT CACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACCCAGAAGGAGTT CTCC 1395 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 

TAT GAGTTCACAGAGGGCACGGTGGTT CTGAAGCCAGGAGAGACCCAGAAGGAGTT CTCC 480 
Qy 1396 

GT GGGCATAAT T GAT GAC GACATTT TT GAGGAGGAT GAACACTT CTTT GTAAGGTT GAGC 1455 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 

GT GGGCATAATT GAT GACGAC ATTTTT GAGGAGGAT GAAC ACT T CTTT GTAAGGTT GAGC 540 
Qy 1456 



AAT GT C C GCAT AGAGGAGGAGCAGCCAGAGGAGGGGAT GCCT CCAGCAAT ATTCAACAGT 1515 



I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 541 

AAT GT C CGC AT AGAGGAGGAGCAGC CAGAGGAGGGGAT GCCT CCAGCAAT ATT CAACAGT 600 
Qy 1516 

CTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGAT 1575 

I I I I II I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 601 

CTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGAT 660 
Qy 1576 

GAT GAC CAT GCAGGCAT CTT CACTTTT GAAT GT GAT ACT ATT CAT GT CAGT GAGAGT ATT 1635 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 661 

GAT GACCATGCAGGCAT CTT CACTTTTGAAT GT GAT ACT ATT CAT GTCAGT GAGAGT A- T 719 
Qy 1636 

GGT GTT AT GGAGGT CAAGGTT CT GC GGACAT C AGGT GC CCGGGGT AC AGTCATC GTC CCC 1695 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 720 

GGTGTAATGGAGGTCAAGGTTCTGCGGACATCATGTGCCCGGGGTACAGTCATCGTCCCC 779 
Qy 1696 TTT- 

AGGACAGT AGAAGGGACAGCCAAGGGT GGC GGT GAGGACT TT GAAGACACAT AT GG 1754 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

I I I I I I 

Db 780 

TTT CAGGAC AGGAGAAGGGACAGC CAAGGCT GC AC GCTAAGGACT T GAAGAC CCATAT GC 839 

Qy 1755 GGAGT T GGAATT CAAGAAT GAT GAAACT GT GAAAACCATAAGGGT 1799 

I I I I I I I I I I I I I II II I I I I I I I I I I I I II I I I III 
Db 840 GGAGTT GGAATT C - C GACT GGT GAAC CT GT GAAC ACCATCACGGT 883 



Search completed: January 22, 2005, 17:29:15 
Job time : 8618.76 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



January 22, 2005, 05:47:49 ; Search time 1295.46 Seconds 

(without alignments) 
11208.289 Million cell updates/sec 

US-10-054-680-1 
2766 

1 atggcgtggttaaggttgca gctacatcaaggggttctaa 2766 

IDENTITYJtfUC 

Gapop 10.0 , Gapext 1.0 



4134886 seqs, 2624710521 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



8269772 



Post-processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : N_Geneseq_2 3Sep04 : * 

1 : geneseqnl980s : * 

2 : geneseqnl990s : * 

3 : geneseqn2000s : * 

4 : geneseqn2001as : * 

5: geneseqn2 001bs : * 

6 : geneseqn2002as : * 

7 : geneseqn2002bs : * 

8 : geneseqn2003as : * 

9 : geneseqn2003bs : * 
10 : geneseqn2003cs : * 
11 : geneseqn2003ds : * 
12 : geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Result 



SUMMARIES 



% 

Query 



) . 


Score 


Match Length DB 


ID 


Description 


1 


2766 


100.0 


2766 


6 


ABQ78861 


Abq7 8861 Human ion 


2 


2766 


100.0 


3812 


6 


ABQ78863 


Abq7 8863 Human ion 


3 


2764 .4 


99. 9 


2766 


6 


ABQ78864 


Abq7 8 864 Human ion 


4 


2764 .4 


99.9 


2828 


12 


ADH42302 


Adh423 02 Novel hum 


5 


2764 .4 


99.9 


2966 


6 


ABZ33735 


Abz3 373 5 Human TRI 



6 


2763 


99 


9 


2778 


12 


ADH4 2114 


auh?z jifi i\iovex nuiu 


7 


2763 


99 , 


Q 

■ -7 


2778 


1 9 

X 


ADH4 9106 


/\cin.ft^jUD i\iovex num 


8 


2761 . 2 


99 . 


. 8 


2782 


\? 


AB.NR149R 


A K-n R'lAOR Unman 4->-a 
AJjIlD JftZO rlU. IT lei 11 Lid 


9 


2753 


99 , 


. 5 


2769 




AB07 8865 


AV\/T r 7QQ£rc Unman "i r\n 
rtXJLJ/OODD nUllla.Il lOU 


10 


2751.4 


99 






o 


rtXsv^ / OOou 


ADq/ooDb numan ion 


11 


2733 .4 


98 . 


8 


2 781 


D 


ARA0475fi 


A V"\ artATCC Ui i m an n o 4- 
HDaU^ / Do XlUITlari IiaC 


12 


2673 . 2 


96 . 






o 

O 




ADXODZbJ WUITlcLn JNUV 


13 


2673 . 2 


96 , 


6 


o o _> 


T 9 




aqii^zjuo jMovej. num 


14 


2673 . 2 


96 . 


6 


O O _> 


X j£ 


nL/flt ^ j X ^ 


Aan4 z j i z jm ove i num 


15 


2657 . 6 


9fi 


1 


4b O *± \J 


Q 
O 


AP.YR £9 £9 


ADXbbzoz riuman jmov 


16 


2657 . 6 


96 . 


, i 


2840 


12 


ADH4 9104 


rt.u.ii*_ z jU4 jm ove x num 


17 


2367 . 2 


85 . 


6 


2813 


8 


ARX5fi9fi1 


SKy^^O^I Unman XT/M7 

HJjaDdzdI riuman IMUV 


18 


2367 . 2 


85 . 


6 


2813 


12 


ADH4 9110 


rt.u.iift zjiu in ove x num 


19 


2367 . 2 


85 . 


6 


2813 


12 


ADH4 911 


Hun^z jid JNOvej. num 


20 


1786 . 4 


64 . 


6 


2534 


10 


ACmnd 1 a 

rt.^ \J VJ X *± 


Accuufixfi riuma.n oyu 


21 


1784 . 8 


64 , 


5 


126512 


5 


ARNR 14 9 9 


fiKnP'JAO Q Unman •H-K-a 
rtJjIlOjftZi? ilUlUall Cio 


22 


1784 . 6 


64 . 


5 


1863 




AB07 8B6? 


AV^rrTRR^O Unman -! <-^n 

Hjjy / ooDz xiuiiicin ion 


23 


1326 . 4 


48 . 


0 


2814 


4 


AAH57177 


A^h^7177 Unman Vioa 
rt.dll j loll nu.llla.il Ilea. 


24 


1294 . 4 


46 . 


8 


3037 


i n 

X V 


ADR^99 9 R 


rt.a.jjDi7 z ZD loxicicy- 


25 


1277 


46 . 


2 


42 82 


9 


rt.rt.i_J _j _> _> o / 


A a l c:^RQ r 7 Unman ACC 

Hal j j jo / riumcin ft D 0 


26 


1277 


46 . 


2 


4291 


3 


AAr7570fi 

rtrtv^ / J / O 


113^7^71^^ Unman 

HOL / J / UD XI U. 1 lid 11 UKr 


27 


1227 . 8 


44 . 


4 


4087 




AAD944R0 


A,a.H94ZL^fi Rmr n n <a 
ndUZ *t *t j U OUVxlie 1M^- 


28 


1208 . 8 


43 . 


7 


3250 


12 


AF)P9 1419 

rtXv £r Z> Itlj 


/-VU.p Z 1 ft X S7 Vjcne oLLO 


29 


1208 . 8 


43 . 


7 


5438 


5 


ARV9410 5 


A V~\-t t "~) A ~X f\ Uiiman >^va 

rtjjvzftjUD riu.rria.ri pro 


30 


897 . 4 


32 . 


4 


1187 


-> 


rtrtO U «7 O O 


Adsyuyoo jjjma encoQ 


31 


897 . 4 


32 . 


4 


1187 


Q 
o 


APD05 91 9 
rtv_ j_; u j -/ O -/ 


Acaujyjy JNOvex num 


32 


878.4 


31 . 


8 




1 9 
x ^ 


Ann2 4 ^ r n 

rti-'V^ a -3 J u 


AQqz^Dbu riuman SOI 


33 


821 . 4 


29 . 


7 


823 


1 9 
x ^ 


APUR707 ft 


Acno /u / u riuman gen 


34 


787 . 2 


28 . 


5 


1836 


4 


AAT1 94fi4 

rt_rtX utoi 




35 


787 . 2 


28 . 


5 


1836 


4 


ABA6448 ft 


Z\ha ^ A A Q n Ui i m a n "F rt a 

rtjjciOrtfrou nuiiid.li xoe 


36 


787 . 2 


28 . 


5 


T RIG 

-L O J O 




AAT44 fiR 7 


Aa-I AACC7 DvaKq ttl ^ 


37 


787 . 2 


28 . 


5 


1836 


4 


ARA11 fil 9 

rtXJrt J X O X -/ 


HDaJ lol" riODc tr J- U 


38 


787 . 2 


28 . 


5 


183 6 


4 


AAK1 9 917 


AaKizyj / riuman ora 


39 


787.2 


28. 


5 


1836 


4 


ABS38231 


Abs3 8231 Human liv 


40 


787.2 


28. 


5 


1836 


6 


ABS12734 


Absl2 734 Human gen 


41 


593 .2 


21. 


4 


4546 


4 


ABL09809 


Abl09809 Drosophil 


42 


503 .4 


18. 


2 


505 


12 


ACH73341 


Ach73341 Human gen 


43 


494 


17. 


9 


551 


12 


ACH70908 


Ach7 0908 Human gen 


44 


366 


13. 


2 


366 


12 


ACH84609 


Ach84609 Human gen 


45 


363 


13. 


1 


363 


4 


AAI85824 


Aai85824 Human pol 



ALIGNMENTS 



RESULT 1 
ABQ78861 

ID ABQ78861 standard; cDNA; 2766 BP. 
XX 

AC ABQ78861; 
XX 

DT 09-OCT-2002 (first entry) 
XX 

DE Human ion exchanger protein #1 cDNA. 
XX 

KW Human; ion exchanger protein; NHIEP; nootropic; cytostatic; gene therapy; 
KW antiarthritic; virucide; chemotherapeutic ; cancer; arthritis; antiviral; 



KW gene; ss; chromosome 14. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .2766 

FT /*tag= a 

FT /product = "Ion exchanger protein 1" 

XX 

PN WO200259316-A2 . 
XX 

PD 01-AUG-2002. 
XX 

PF 22-JAN-2002; 2 002WO-US00 18 17 . 
XX 

PR 23-JAN-2001; 2001US-0263384P . 
XX 

PA (LEXI-) LEXICON GENETICS INC. 
XX 

PI Friddle CJ, Hilbun E; 
XX 

DR WPI; 2002-599791/64. 

DR P-PSDB; ABB81913 . 
XX 

PT Novel polynucleotides encoding human ion exchanger proteins that are 

PT structurally related to mammalian sodium- calcium exchanger proteins, 

PT useful for drug screening, diagnosis and in gene therapy of biological 

PT disorders. 

XX 

PS Claim 1; Page 36-37; 42pp; English. 
XX 

CC The invention relates to a novel human ion exchanger protein (NHIEP) , 

CC that shares structural similarity with mammalian sodium-calcium exchanger 

CC proteins, and potassium dependent versions of the same. The NHIEP of the 

CC invention has nootropic, cytostatic, antiarthritic , and virucide 

CC activity. The polynucleotide may have a use in gene therapy. NHIEPs can 

CC be targeted by drugs, oligos, antibodies etc., in order to treat disease 

CC or to therapeutically augment the efficacy of chemotherapeutic agents 

CC used in the treatment of cancer, arthritis, or as antiviral agents. The 

CC sequence encodes a NHIEP of the invention 

XX 

SQ Sequence 2766 BP; 655 A; 678 C; 760 G; 673 T; 0 U; 0 Other; 

Query Match 100.0%; Score 2766; DB 6; Length 2766; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2766; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
Qy 1 ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

1 1 III 1 1 1 1 II I I Ml III! Ill I III 1 1 1 1 MM 1 1 1 II II 1 1 II II II III I ! I II 

Db 1 ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

Qy 61 TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 12 0 

lllllllll MM IIIIIMIIIIIMIIMIIII llllll IIIMIIIIII Mill III 

Db 61 TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

Qy 121 ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 180 

1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 



Db 


12 1 


APAPPPPAPAAPAATPAPTPPTPTTPAPPPTPATPPPaPTPPaaPPaPPPTPTPaTPPTP 
nv-ftvjuvj ^nurtfiLrtfi. l OrtO icv-iui J- \—rt.ooo 1 LH 1 V_oottlw 1 uLHriuuHuuu 1 o 1 Ltt 1 k^V^ 1 o 


i fin 
1 0 u 


Qy 


181 


CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 


240 






Ml Ml Ml Mill IIIMMIIIIIIIMIII MM Mill lllrllM 




Db 


181 


p p a AT P tppt A p p rnn a p a a p p pttp p PTTnnnr! a p a a p a tt^ p p a ppp tp aTTPTPTAT 


O A Pi 
Z4 U 


Qy 


241 


TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 


300 






1 1 1 1 1 1 1 1 1 1 i f 1 1 1 1 1 E i 1 1 1 1 1 1 1 1 1 L 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 F 1 1 1 1 1 1 1 1 1 j 1 j 1 1 




Db 


241 


T TTPTPP P P P TP A T A T A P A THTT P PTTPPPptpt P P a TP a TTn p TP a p p p pt t p a tp p p a 




Qy 


301 


TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 


360 






1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 




Db 


301 


TPTATTPtAAPtTPATPAPPTPTPA APAPAPPPAPPTPapa aTTaapa a apppa aTpp ap a a 

X \w X.tt. X X O.rtrt.0 X J. V*..rt.V^V_ X V_ X V^i-ii-4.0M.O.M.OOOttOO X VjM.V^i-Lrt. 1 1 MJ\\jMJ\±\\^\^\^i\t\ 1 OOttotttt 


*5 £ a 


Qy 


361 


ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 


420 






I Ml Ml MM IMMMM Ml MIM MIM III Ml MM 




Db 


361 


APPAPPAPA appapt ATTPPPPTPTPPa aTPa a aPTPTPTPPa apptp apppttatpppp 


/inn 
4zU 


Qy 


421 


CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


PTPPPTTPPTPTPPTPPTPAPaTaPTPPTPTPTTTa aTTPaPPTPTPTPPTPATPPPTTP 
xooo x i x v_ xov_ lUL 1 uriuri X 1 v^v^ lLlv-i 1 lnnl lottoo iulvjlbblLAlvjuu 1 1L 


/I Q A 

4 BU 


Qy 


481 


ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 E ] 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 




Db 


4 81 


ATTPPTPPTPATPTPPPAPPTTPTAPP ATTPTaPPPaPTPPaPPPTTPa a P A TPTTP ATP 

/I A X OOlOrtl \_ X UUUriL ^ 1 1L lul ttOOOttO 1 OLttoL.\^ X X lul 1 L.A1 L. 


C /I A 


Qy 


541 


ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 


600 






IM Ml Ml MM MM IMMMI MM MM III III MM 




Db 


541 


ATPATTPPPATPTPTPTPTappTPaTPPPapapppapapa ptppp a ap atpa appatpta 


c a a 
0 U U 


Qy 


601 


CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


660 






II MIMI 1 MM MM Mill llllll MM II Mill MM 1 Mill MM MUM 




Db 


601 


PP APTPTTPTTPaTPAPPPPTPPTTPP APTATPTTTPPPT A P ATPTPPPTPT A T ATP ATT 
v_o.m.o x v_ llLll LHiLaLLoblbt 1 Hjo.tt.vj 1 Al L. 1 X loLL. lttL.tt.xL. looL. 1 L. lAiAlbAl 1 


c c a 


Qy 


661 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 


720 


Db 




1 II MM II II II II 1 II II III II II 1 II MM II II II II II II II II II II II MM 




661 


PTPPPAPTPTTPTPPPPTPPTPTPPTPPaPPTTTPPPa aPPPPTPPTP APTPTPTTPTTP 
1 uo^rtu itiit 1 OO X 0 1 00 1 ^Lrtuu 111 000AA00L.L. 1 L.L. 1 L.AL. 1L.1L.1 1L.1 1 L. 


/ z U 


Qy 


721 


TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 


780 






Mill MM III Mill llllll MM II llllllllll MM MIIMIMMIMM 




Db 




TTTPPa PTPTPTPTPPTTPTPPPPTPPPTPPPAP ATA A A PP A PTPPTPTTOT A P A A A TA r* 

ill L. L.tto I0I0I0IL.L. 1 1 L. I00L.L. lL70L7lL7L7L.AL7AlttttAL.L7AL. loL. 1 L. 1 1 L. 1AL.AAA1AL. 


■"7 0 r\ 

7 80 


Qy 


781 


ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 


840 






1 M 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II II 1 1 1 llllllll 1 1 II 1 1 | || | 1 | 1 | || || || | Ml 




Db 


7 ft 1 
/ox 


a TPPa P a a a a a pt a rpnp apapapaaapa ppp a pp a a tt a to AT^nnpAOAnA ppptp 7\ r-* 
tt 1 oL.ttL.ttttAAAo 1 AL.L.0L.AL.AL7AL.AAAL.AL.U0A00AAI 1 A 1 L.A 1 AoAoALAoAGooToAC 


84 0 


Qy 


841 


CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 


900 






1 1 1 f 1 1 1 1 1 1 1 r 1 1 1 1 1 f 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


Ort X. 


P a PPPTa aPPPP A TTP AP ATPP A TPPP A AAA TP A TP A A TTPPP A TTTTfiT A A m^nr" A7\n 

^-nbtt ltttt.000L.Al 1 0A0A 1 00A 1 oooAAAA 1 oA 1 oAA 1 1L.L.L.A1 1 1 1 L. 1 Ao A 1 bbbAAC 


900 




QM 


r >r mrx r pnnr , r^r ,r rnn, a a ppp a a pp a a ptpp a tp a n r rnnnnr* a p a p a p a tp a Tnnr^n a ttptp 

0 x OO X uLLLb 1 uurtHouuHHbbHAb 1 oott 1 otto 1 L.L.L.0L.A0A0A0A 1 oA 1 L.L.00A 1 1 L. 1 L. 


O ^ A 

you 






1 II 1 II 1 1 1 1 1 1 1 II III 1 1 1 II II 1 1 1 lllllllll 1 II Ml II II 1 II II II Mill 1 




Db 


901 


CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 


960 


Qy 


961 


AAGGATCTGAAGCaAaAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 


1020 






1 1 1 1 1 1 1 1 1 1 Ml 1 II 1 1 1 M II Ml 1 II II II II 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 M 1 1 1 II 1 




Db 


961 


AAGGATCTGAAGCaaAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 


1020 



Qy 


1021 


TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 


1080 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 El 1 1 1 1 1 1 1 
1 II 1 II 1 1 1 II II II 1 II II II I II 1 1 II 1 1 II II II II II II II 1 1 1 1 1 II II II II II 




Db 


1021 


TAPTATGPTPTTTPPPAPPAAPAGAAGAGPPGPPPPTTPTAPPPTATPPAAGPPAPTPOT 




Qy 


1081 


ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 


1140 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 II II II 1 1 1 II 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1081 


ATGATGAPTGGTGPAGGPA ATATPPTGAAP A A APATPP AGP AP1\Z\P A APPP A AP A APPPP 

niunlunV, 1UU X \j\^n\j\j\*nn. 1. 1\ X \— v_ X \jr\n\jnr\n\*,n X OV_.rtOV»i^VjiHrtV_.iT-r-.VJV— V-.iHiH.VjiT_H\jVjV»,V_, 


114 0 
1 11 U 


Qy 


1141 


TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 


1200 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 M 1 1 1 II II II II 1 II 1 1 II II 1 1 II II II II II 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 




Db 


1141 


TCCAGPATGAGPGAGGTGPAPAPPGATGAGPPTGAGGAPTTTATTTPPAAPGTPTTPTTT 


i 9 n n 

1 z u u 


Qy 


1201 


GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 


1260 






IIIIMI MM! IIIIIIIMIIII! Mill Mill II M II II 1 MM: il 1 1 1 




Db 


1201 


GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 


1260 


Qy 


1261 


AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 


1320 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 l l 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1261 


AAAGGGGGAGAPATGTPAAAGAPPATGTATGTGGAPTAPAAAAPAPAPPATPPTTPTPPP 




Qy 


1321 


AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 
II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I I I I I I I 


1380 


Db 


1321 


1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AATGPAGGGGPTGAPTATGAGTTPAPAGAGGGPAPPGTPGTTPTPAAPPPAPPZlP2APi\PP 


1 1 ft o 


Qy 


1381 


CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 


1440 






1 M I ! I I I I I M I I 1 i I I i i i i i I i i i i i i i i i i i i i i i i i i i I i i i i i I i i i i i i i i i i 
1 II 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 II 1 1 1 1 II 1 1 1 




Db 


1 TQI 
x 3 O X 


papa AGGAnTTPTPPnTnpppziTZiZiTTn2iTPZiPPapaTTTTTPZinnapp2i r rriziapaP r r r rr i 


1 A A C\ 
l*t ft U 


Qy 


1441 


TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 


1500 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 l l l l l l l l l l l l l l l l i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 II II 1 1 1 II 1 1 1 1 1 II 1 1 1 II II 1 1 1 II II 1 1 1 II 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


144 1 


TTTGTAAGGTTGAGPAATGTPPGPATAGAGGAPPAPPAPPPAP2\PPZiPPPP2\TPPPTPP2\ 

X X X VJ X rtfiUVJ X X Ort.OV_/-Lri. X VJ X V- V- VjV^-rt. X riUrtUvJrtOUrtvj^/iUL V».iH\JrtOVJrtVJVjVJVjiT. 1 VjVw V^ 1 v_.v_.rt. 


i cnn 

1DUU 


Qy 


1501 


GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 


1560 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1501 


GPAATATTPAAPAGTPTTPPPTTGPPTPGGGPTGTPPTAPPPTPPPPTTPTGTnnPPZiPi\ 

vjv„nriini x v_ nr\\~r\\j lui X v_ v_v_ x x VJV_V_ X LuOOC lulLL X rtu^L 1 X Vj X VJ 1 OvjL.v_.rt.v_.rt 


i ccn 


Qy 


1561 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1620 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I i i i 
1 II II 1 II II 1 1 1 II II 1 1 1 II II 1 1 II II 1 1 II 1 II 1 1 1 II II 1 1 II 1 II II 1 II II II 




Db 


JL JOl 


GTT , APP2\TPT , TPPA ^ ^P2\ , T , PZiPPl\ r ^PPZiPPPZiTPT ^ ^PZ\^ , T ^ ^T , T^ , A A rnfirp,^ a T A OT 1 A ttp a T" 1 
\J X X/-iv_V_M.X V_ X X vjVjrt XVjrt X vjrtv_v_.rt Ivjv-rtVjvjUrtl V_ 1 1 v_rtv_ 111 luAAlljloAlAL ±rt± 1 LHi 




Qy 


1621 


GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 


1680 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 l 1 1 1 1 1 1 I 1 1 1 1 I 1 1 I 1 i 1 i i i i i i i i i i i i i t i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II 1 M II M II II II II II II II II 1 1 1 1 II II II II 1 




Db 


1 69 1 


VJ 1 v_rtvj 1 vjrtVJrtVj Inl X vjo x VJ x 1H1 vjortvjvj X V_.AH.vjVj 1 IL 1 bLbuALA 1 LAoo 1 Vj V_ V_ v_ VjvjVjvj 1 


loo U 


Qy 


1681 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


1740 






1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l ( l l l l l l l l l l l l l I l I l I I I I I I I I I I I I I I I 
1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


-L D O J. 


ZXPAPTPZVTPPTPPPPTTTAPPZiPAPT AP A A. PHP A P APPP A aPPPTPPPPPTPflPP A r , T"T"T l 
rtLrtUlV^riiLVJlLLL\-.l i 1 rtVj^rtv_.rtO X rtvjrtrtvj7vj^rtv_rtVjv_ v^rtrt.Ovjvj' 1 vjVjv_ vjVjr 1 wrtVjivjrtv^ 1 1 1 


T H A C\ 


Qy 


1741 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 


1800 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 




Db 


1741 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 


1800 


Qy 


1801 


AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 






1 1 1 II 1 1 1 1 1 1 ! 1 1 1 ! 1 II 1 1 1 1 II 1 Mill 1 1 1 II 1 1 II 1 1 II MM 1 1 II II 1 II 1 1 




Db 


1801 


AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 



Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 


Qy 


2041 


Db 


2041 


Qy 


2101 


Db 


2101 


Qy 


2161 


Db 


2161 


Qy 


2221 


Db 


2221 


Qy 


2281 


Db 


2281 


Qy 


2341 


Db 


2341 


Qy 


2401 


Db 


2401 


Qy 


2461 


Db 


2461 


Qy 


2521 


Db 


2521 


Qy 


2581 




Zoo 1 


Qy 


2641 


Db 


2641 


Qy 


2701 



CCGAAATGGATGGAACGTGGAATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 192 0 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 II I M 1 1 1 1 

CCGAAATGGATGGAACGTGGAATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 192 0 
GAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTGGGTGAACACCCCAAACTA 1980 

I Ml I II I II II M II III I II MM llllllllll INI I MM I IIIIMIMI Ml 

GAGGAGGCCAAGAGGATAGC AGAGATGGGAAAGC CAGTATTGGGTGAACACC CCAAACTA 1980 
GAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 2 04 0 

1 1 1 1 II II I II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 

GAAGTC ATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 204 0 
ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 2100 

I I Ml II I II 1 1 II 1 1 II II III II I Mill I II 1 1 MM 1 1 II 1 1 MM 1 1 M 1 1 II I 

ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 2100 

ACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCC 2160 

I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
ACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCC 2160 

TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 222 0 

1 1 1 1 1 M I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M J I M 1 1 

TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 222 0 
CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 22 80 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 22 80 
ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 234 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 234 0 
GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 24 00 

I II II II II 1 1 III II II II I II II I MM I II 1 1 II III I II 1 1 1 1| || 1 1 1 M | II I 

GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 2400 
AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 24 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 F 1 1 

AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 2460 
AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 252 0 

M II 1 1 1 II II I II 1 1 1 Ml 1 1 II II I III 1 1 II M 1 1 II 1 1 II II II II II I II 1 1 II 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 252 0 
TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2580 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 25 8 0 
CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 264 0 

1 1 1 1 1 1 1 1 1! M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 M 

CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 264 0 
CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 27 00 

I I INI I Ml 1 1 1 1 1 1 1 1 llllll 1 1 1 1 III II I II 1 1 1 1 1 1 III I II 1 1 II II 1 1 Ml I 

CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 2700 
2701 AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2760 



I! Illlllllllllllll Illlll Mill IIIIIIMIIIMIIII INI! II MM III 

Db 2701 AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2760 

Qy 2761 TTCTAA 2766 

Illlll 

Db 2761 TTCTAA 2766 



RESULT 2 
ABQ78863 

ID ABQ78863 standard; cDNA; 3812 BP. 
XX 

AC ABQ78863; 
XX 

DT 09-OCT-2002 (first entry) 
XX 

DE Human ion exchanger protein cDNA #3 . 
XX 

KW Human; ion exchanger protein; NHIEP; nootropic; cytostatic; gene therapy; 

KW antiarthritic; virucide; chemotherapeutic ; cancer; arthritis; antiviral; 

KW gene; ss; chromosome 14. 
XX 

OS Homo sapiens . 
XX 

PN WO200259316-A2. 
XX 

PD 01-AUG-2002. 
XX 

PF 22-JAN-2002; 2002WO-US001817 . 
XX 

PR 23-JAN-2001; 2001US-02633 84P . 
XX 

PA (LEXI-) LEXICON GENETICS INC. 
XX 

PI Friddle CJ, Hilbun E; 
XX 

DR WPI; 2002-599791/64. 
XX 

PT Novel polynucleotides encoding human ion exchanger proteins that are 

PT structurally related to mammalian sodium-calcium exchanger proteins, 

PT useful for drug screening, diagnosis and in gene therapy of biological 

PT disorders. 
XX 

PS Disclosure; Page 41-42; 42pp; English. 
XX 

CC The invention relates to a novel human ion exchanger protein (NHIEP) , 

CC that shares structural similarity with mammalian sodium-calcium exchanger 

CC proteins, and potassium dependent versions of the same. The NHIEP of the 

CC invention has nootropic, cytostatic, antiarthritic, and virucide 

CC activity. The polynucleotide may have a use in gene therapy. NHIEPs can 

CC be targeted by drugs, oligos, antibodies etc., in order to treat disease 

CC or to therapeutically augment the efficacy of chemotherapeutic agents 

CC used in the treatment of cancer, arthritis, or as antiviral agents. The 

CC sequence encodes a NHIEP of the invention, with regions of flanking 

CC sequence 

XX 

SQ Sequence 3812 BP; 860 A; 1059 C; 1041 G; 852 T; 0 U; 0 Other; 



Query Match 100.0%; Score 2766; DB 6; Length 3812; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2766; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 I t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 II M II 1 II II II 1 II II II 1 II II 1 II II II II 1 II II 1 II 1 II II 1 1 II II I 1 Ml 




Db 


618 


atpppptpptta APPTTPPAPPPTPTPAPPTPTPPPTTPPTPPaTTTTPPPPTppTTapp 


O / / 


Qy 


61 


TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 l l l l I l 
1 M 1 M 1 1 1 1 1 1 M M 1 1 II 1 II 1 1 II 1 1 1 II II II 1 II 1 II I I I 1 I I M 1 1 1 1 1 1 II 1 1 




Db 


678 


TTTGTPpTPTTPPTPAATPPTPTTPPAPPAPAPPPTPPTPPPTPAPPPPAPPTPPPA APP 


/ J l 


Qy 


121 


ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 


180 






1 1 1 1 1 1 1 1! 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M II II II II 1 1 1 1 1 II II II II II II II II II 1 1 II II 1 II M II 1 II 1 1 M M II 1 II 




Db 


73 8 


AP APPPPAP A AP A ATP APTPPTPTTP A nnriTP Q TPHP A PTP P a a pp APPPTPTP ATPPTV 


/ y / 


Qy 


181 


C C AATCTGGTAC C CGG AGAAC C CTTC C C TTGGGGAC AAG ATTGC C AGGGTC ATTGTC TAT 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 l l l l l l l l l l I I I I I I I I I I I I i i 

M M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II II II 1 I I I II | | || | || || M | | | | | || | | MM 




Db 


798 


rpAATrTnRTAPPPnnArjAapppTTPPPTTppppapaAPaTTpppapppTPBTTPTPTAT 1 




Qy 


241 


TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l l I I I I 

II 1 1 1 1 1 1 II II II 1 II 1 1 1 II 1 1 1 1 1 II M 1 1 1 II II 1 II II 1 II 1 1 1 1 1 1 1 II 1 II II 




Db 


858 


TTTPTPPPPPTP AT A T A PATPTTPPTTPPPPTPTPP a TP a TTPPTP a PPP PTTP a TPP P A 


on 
y 1 I 


Qy 


301 


TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 l l l l l l l l l l 

II II 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 M 1 1 II 1 1 II 1 1 II II M M M M 1 1 1 




Db 


918 


TPTATTPA APTP ATPAPPTPTPa apap appp APPTPAPa aTTa apaaapppaaTPPnpaa 


Oil 

y 1 1 


Qy 


361 


ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 


420 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I 

II M 1 1 1 1 1 1 II II II 1 1 1 II 1 1 1 1 II 1 II II 1 1 II 1 1 II 1 II II 1 II II 1 II II 1 1 1 1 1 




Db 


978 


APPAPPAPA APP APTATTPPPPTPTPPA A TP a aarTHTPTPPB A f-«/-irp/-i t\ pppTTATPPPP 




Qy 


421 


CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 


480 






1 1 1 1 1 II 1 II 1 II M MMMM M 1 1 1 1 M M 1 M M MM M MMMMIM MMI 




Db 


1038 


CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 


1097 


Qy 


481 


ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 


540 






1 1 l l l l l l l l l I I I I I I I I I I I i i i i i ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i » i 

! 1 M II II II M ! 1 1 1 1 ! ! II II II II 1 1 II 1 II II II II II 1 II II 1 II 1 II 1 1 1 1 1 1 






± u y o 


AI ItjC IoOIvjAIU ILjUUAUv- 1 ILiALLAl Lvj 1 ALjCjCjAvj 1 LjLACjLL 1 1 CAAC ATGTTCATC 


1157 


Qy 


541 


ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 


600 






I i I I I I I I I I I i I i I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i > 

1 1 1 M 1 1 I M 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I 1 1 I I 1 1 1 I 1 1 I 1 1 1 1 1 1 | 1 1 | | | || | 






1 1 


A 1 l_A 1 1 talaVwAi U lblbl L. 1 AL.U 1 VaA 1 CLLAbALLjCjACjAuACTCCjCAAGATCAAGCATCTA 


1217 


Qy 


601 


CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


660 


Db 


1218 


1 1 1 1 1 1 1 1 1 1 M 1 M 1 II M 1 1 1 1 1 1 1 1 1 1 II M 1 1 M 1 1 1 1 1 1 M 1 1 M 1 M 1 II M 1 1 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


1277 


Qy 


661 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 


720 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 M MM 1 1 M 1 1 1 M 1 M M M 1 M 1 1 1 M 1 1 1 1 




Db 


1278 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 


1337 


Qy 


721 


TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 


780 






1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 M M M M 1 1 1 1 1 II M 1 M 1 M M 1 M M II 1 M M 1 M 1 




Db 


1338 


TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 


1397 



Qy 


781 


ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 II 




uu 


1 TOO 




14 57 


Qy 


841 


CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 


900 






Mill II Ml IMMMIMIMM MM III Illllllll IMM 




Db 


1 / CO 
J. *± J O 


pzippptb. a ppr!P a ti^p a p a tpp a nppr'P a a a atpbtpa attpppa 'tt'T'ppt rp stppp a ap 


±b 1 / 


Qy 


901 


CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 


960 






II 1 III 1 1 MM 11 INI 1 MM MIMIIIIII llllll MM 1 INI 1 MM 1! MM 




Db 
uu 


1 ci p 
IjlO 


(^^CXCZ r TClc i c*c*(~* r vn.c± a a mcc A appa a ptpp A TP a ptppppp a papapatpa tpppp A TTPTP 
V- 1 Ljo HjLLLLI uoAAbb^aAAbuAAo 1 uoA 1 LjALj ILL Lov_ AoAbAbA 1 LjA i L. L. bbA 1 I L 1 C 


1577 


Qy 


961 


AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 


1020 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 | I I | I I | | | | | | | | | | 




nk 
uu 


ID / O 


A APP A TPTP A A P/" 1 A A A A AOA^^T^A^A/^A A fT" 1 A t ill M 1 1 A /"* A «"P/**i 7\ P /"iTTTiTinn 7\ /"*• 7\ mnnnnn 7\ m 

AALjLjAI L- I oAAL7L-AAAAAl.AL-LL.AL7ALjAALjGALTTAGATCAGC 


1637 


Qy 


1021 


TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 


1080 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


J. O J O 


r rz\p r rz\TP»PT , p , T r r r rpppziPPzi z\pft.pzi 2\^z^rir'prir , r , r , nT , T^P r FAr ,, r i r ,r rA r pr i r , A apppa o r p/~'/^ | T' 

J-MA_ J-rtHaL. ILi 1 i.LLLHLLHHLHbAHljAuLLbLoLL 1 1L lALLb lAlLLAAbLLAL J.L.LJ 1 


1697 


Qy 


1081 


ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 


1140 






1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 




Db 
uu 




ATHATHA P TPP TP PAPPPA ATATPPTP AAPAAAPA TPPAPPA A A/^aa/^ooa a/~»A7\ r*r*n/~* 
A 1 Lri\ 1 Li/\L. 1 LjLj 1 LjL.ALjLjL.AA IA1LL 1 LjAALxAAAL A 1 (jL. ALjL. ALjAAL-AALiLl, AAGAAGGCC 


1757 


Qy 


1141 


TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 


1200 






II: III MIMMl: MM III MM III MM Ml MIMIII 




nb 

uu 


1 TCQ 

1 / J(3 


1 UL-ALjUAI LjALtL-LtALtLj lLjL.AL.AL.LbAT(jALjCCTGAGGA 


1817 


Qy 


1201 


GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 


1260 






Ml Mill III III 1 MIMMIMI MUM II IIIIMM Mill IIMMIMM II 




Db 

UU 


X O X o 


dRCCC ATPTTPTTZi PP a PTPPPTPP a p a a r* T T i r %r vr*r , r , r*r* r rr* T r*7\. htpptp a p^rnTinnrno 7v nn 
OM.L.L-L.M.1LJ1 iLl 1 AL»L.ALt1LiLL ILjLjALjAAL ILj ILjLjLjLiL. ib 1 AC 1 L.C 1 CjtACALjTGGTGAGG 


1877 


Qy 


1261 


aAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 


1320 






II III Ml MM MM III MM III MM III MM MM 




Db 


1 fi7fi 
X o / o 


A Z\ ZiP.r2P.nn AP A P A TPTPA A AP APPA TPTATPTPP APT'APA AA 7\ P 7\ P 7\ rri 7\ HP/'i rirnrnnrnrt/^/n 

mah\jLjLjLjL7ALjAL.A 1 Lj 1 L.AAALjAl.L-A 1 Lj I A 1 Lj 1 LjLjAL i ALAAAALAGALjGATGGT TCTGCC 


1937 


Qy 


1321 


AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 


1380 






M MIMI Mill 1 MM MMMIMMIMM llllll MM Mill MM! II Ml 




Db 
uu 


1 QOQ 
X -7 J O 


A A TPP A PPPPPTP A PT A TP APTTPAPAPAPPPPA ppnnipprnrnrirnp 7. 7\p/-in7\riri7\ri7\m\ /^/^i 

ah, 1 LjLALjLjLjLjL. i. L7AL. 1 A l LjALj 1 1 L.AL.AL7AL7LJL7L.AL-GG TGGTTCTGAAGCCAGGAGAGACC 


1997 


Qy 


1381 


CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 


1440 






1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j r 1 f 1 1 1 1 1 1 1 1 f 1 1 1 1 1 




Db 


1998 


CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 


2057 


Qy 


1441 


TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 


1500 






IMIIMIIIIIII IMIIIIIIIIIMIIIMI IIIIIIIIIMIIIIIIMIIIMII 




Db 


Z U O O 


TTTP T A A PPTTP A P P A A TPTPPP PAT^APAPPA PP 7\ 7\ /-1 r~\ /~s t\ /-1 tv /—» /-> 71 /tn/'i/'iTv mrt/tnm/i/'i'n 

1 1 iltIAAljLjI IuALjLAAIGIlXGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 


2117 


Qy 


1501 


GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 


1560 


nb 


z X 1 o 


1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 MM II 1 1 II II II II II 1 1 M 1 II 1 II II 1 1 

bLAAlAi 1 LAALAb 1 L> xTL-CCTl GLLTL.GGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 


2177 


Qy 


1561 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1620 






M 1 MM 1 1 1 1 1 1 1 1 1 1 1 Mill Ml llllll II 1 1 MM II II M 1 1 II 1 III M 1 1 M 




Db 


2178 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


2237 


Qy 


1621 


GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 


1680 



Illlllllllllllllllllllllllllllllllllllllllllllllllllllllll 

CAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 22 97 



Db 


2238 


GTPAGTGAGAGTATTGGTGTTATGGAGGTPAAGGTTPTGPnPAPAT^ 


77 97 


Qy 


1681 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


1740 






1 1 1 It 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

M II II II 1 II II II 1 II 1 II M II II 1 II II 1 II II I II 1 II II 1 1 1 M 1 II 1 MINI 




Db 


2298 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


2357 


Qy 


1741 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 


1800 






I 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 1 II II II 1 1 II 1 1 1 II II 1 1 1 II 1 1 II II 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


2358 


GAAGACACATATGGGGAGTTGGAATTPAA.GAJVTGATGAAAPTGTGAAAAPPATAAGGGTT 


9417 


Qy 


1801 


AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2418 


AAAAT AGT AG ATG AGG AGG AAT AG G AAAGGP A A G A G A A TTTP TT P A TTG PP PTTGGTP A A 


7477 


Qy 


1861 


CCGAAATGGATGGAACGTGGAATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 


1920 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 
1 1 1 1 II 1 1 1 1 1 1 II II 1 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 II 




Db 


2478 


CCGAAATGGATGGAACGTGGAATATCAGATGTGAGAGArAGGAAGrTGAGTATGnZiAGAA 


7 ^ ^ 7 


Qy 


1921 


GAGGAGGCC AAG AGGATAGCAGAGATGGGAAAG CC AGTATTGGGTGAACAC C CCAAACTA 


1980 






1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 
1 1 1 1 II 1 1 II 1 1 1 II II 1 II II 1 1 1 II 1 1 1 II II II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 M 1 I 




Db 


2538 


GAGGAGGCCAAGAGGATAGrAGAGATGGGAAAGrrAGTATTGGGTGAapanrrr2\2\Z\PTa 


Z D y i 


Qy 


1981 


GAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 


2040 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I 1 I I I I I I I I I l l I I I I I I I I I I I i I i I i i 
II 1 II 1 1 1 1 II 1 II 1 1 1 1 II 1 II II 1 1 II II 1 II II 1 II II 1 1 II II 1 II II II II II 




Db 


2598 


GAAGTCATCATTGAAGAGTPPTATGAGTTrAAGAPTAPGGTGr?Z\P2\ AZ\PTPZiTPZiZiPZi2ifi 


z o o / 


Qy 


2041 


ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 


2100 






I 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II II M M 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2658 


ACAAACCTGGCCTTGGTTGTGGGGAPPPATTPPTGGAGGGAPPAGTTPATnnAPPPPaTP 


7 7 17 
Z / X / 


Qy 


2101 


ACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCC 


2160 






II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 II 1 1 1 1 1 M 1 M 1 1 M II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I || I I || || I 




Db 


2718 


ACPGTPAGTGPAGPAGGGGATGAGGATnAGnATGAATPPnnPPZiPniiPannPTnPPPTPP 


Oil 7 
Z / / / 


Qy 


2161 


TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 


2220 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 
1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2778 


TGPTTTGAPTAPGTPATr?PAPTTPPTGZ\PTnTPTTPTPP2\ Z\PPT , PP ^ ^P ^ ^ ,^ P r P^T , ^ ,r PP ^ P^ ,l T , ^ , 


Zoo f 


Qy 


2221 


CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 


2280 






1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


2838 


CCCCCCAC AG APT APTPPPAPPPPTPPPPPTPPTTPPPPPTPTPP z\ tpptp A r rr , z\ r r r rr , r'r i 


z o y / 


Qy 


2281 


ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 


2340 






1 1 MM 1 1 1 1 1 1 1 III 1 1 1 IIIIMIII II Mill 1 1 Ml 1 1 1 1 1 1 II 1 1 II II 1 1 1 III 




Db 


2898 


ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 


2957 


Qy 


2341 


GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 


2400 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2958 


GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 


3017 


Qy 


2401 


AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 


2460 


Db 


3018 


1 1 1 II 1 1 1 1 1 1 1 1 M II 1 1 M 1 1 II 1 II 1 1 II II II 1 II II 1 II 1 1 1 1 1 1 M 1 II 1 1 1 M 

AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 


3077 


Qy 


2461 


AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 


2520 



MINI Mill IMIIIIIIIIIIMIIIIIII MM! MINIM MINI MM MM 



Db 



3 078 AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 



3137 



Qy 


2521 


Db 


3138 


Qy 


2581 


Db 


3198 


Qy 


2641 


JJD 


"3 O C O 


Qy 


2701 


Db 


3318 


Qy 


2761 


Db 


3378 



TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2580 

. 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 M 1 1 1 1 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 3197 

CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 264 0 

1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 r 1 1 r r 1 1 r 1 1 1 1 e 1 1 1 1 r 1 1 1 r 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 r 1 1 

CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 3257 

CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 2700 

1 1 MM I INI 1 1 II I Mill I Mill INI I llllllllll I III llllllllll INI 

CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 3317 

AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2760 

M 1 1 M Ml 1 1 M I M MIM IMM III M 1 1 1 M 1 1 1 1 1 Ml M M I M M M I II I 

AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 3377 







RESULT 3 
ABQ78864 

ID ABQ78864 standard; cDNA; 2766 BP. 
XX 

AC ABQ78864; 
XX 

DT 09-OCT-2002 (first entry) 
XX 

DE Human ion exchanger protein #1 cDNA A/G mutant. 
XX 

KW Human; ion exchanger protein; NHIEP; nootropic; cytostatic; gene therapy; 
KW antiarthritic; virucide; chemotherapeutic ; cancer; arthritis; antiviral; 
KW gene; ss; mutant. 
XX 

OS Homo sapiens . 
OS Synthetic. 
XX 

FH Key Location/Qualifiers 
FT mutation replace (1889, A) 

FT /*tag= a 

XX 

PN WO200259316-A2 . 
XX 

PD 01-AUG-2002. 
XX 

PF 22-JAN-2002; 2002WO-US001817 . 
XX 

PR 23-JAN-2001; 2001US-0263384P . 
XX 

PA (LEXI-) LEXICON GENETICS INC. 
XX 

PI Friddle CJ, Hilbun E; 
XX 

DR WPI; 2002-599791/64. 



XX 

PT Novel polynucleotides encoding human ion exchanger proteins that are 

PT structurally related to mammalian sodium-calcium exchanger proteins, 

PT useful for drug screening, diagnosis and in gene therapy of biological 

PT disorders. 
XX 

PS Disclosure; Page; 42pp; English. 
XX 

CC The invention relates to a novel human ion exchanger protein (NHIEP) , 

CC that shares structural similarity with mammalian sodium-calcium exchanger 

CC proteins, and potassium dependent versions of the same. The NHIEP of the 

CC invention has nootropic, cytostatic, antiarthritic , and virucide 

CC activity. The polynucleotide may have a use in gene therapy. NHIEPs can 

CC be targeted by drugs, oligos, antibodies etc., in order to treat disease 

CC or to therapeutically augment the efficacy of chemotherapeutic agents 

CC used in the treatment of cancer, arthritis, or as antiviral agents. The 

CC sequence represents a mutant form of a NHIEP of the invention. Note: The 

CC present sequence is not shown in the specification but is derived from 

CC the human NHIEP sequence shown as SEQ ID 1 (ABQ78861) 
XX 

SQ Sequence 2766 BP; 654 A; 678 C; 761 G; 673 T; 0 U; 0 Other; 

Query Match 99.9%; Score 2764.4; DB 6; Length 2766; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2765; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 6 0 

1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 Ml II II 1 1 1 II I MM MM! I II 1 1 1 II 1 1 1 II II III 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 6 0 
TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 12 0 

1 1 II II I II 1 1 1 M 1 1 II 1 1 1 1 M II 1 1 II II 1 1 1 1 1 M I II I M I II III 1 1 II 1 1 1 II 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 12 0 
ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 180 

1 1 1 1 1 1 1 II II II II II II I IMIMI I II II I II M 1 1 1 Ml II I II M 1 1 1 M 1 1 Ml 

ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 180 
CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 

1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 
TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

1 1 II II III II 1 1 M I MM II I MM IMIMI I II 1 1 1 1 M 1 1 M 1 1 MM M 1 1 Ml 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 3 00 
TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 360 

MMMMMMMMMMMMMMIMMMMMMMMMMMMMMMI 

TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 360 
ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 42 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 42 0 
CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

1 1 1 1 1 1 1 Ml M 1 1 1 II 1 1 1 1 M I II II 1 1 1 1 1 MM 1 1 1 II I M 1 1 M M 1 1 1 M M 1 1 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 4 80 



Qy 


l 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 



ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 54 0 

1 1 II II MM I II 1 1 Mill MM 1 1 II II 1 1 III III II II III I MM II Mill 1 1 1 

ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 54 0 

ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 600 

IIIIMIIIIIMI IMIIIIIIIIIIIIIIIIIIIIIII IMIIIMIIMIIMIMI 

ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 600 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

II llllllllll II MM MINI MM MINIMUM MM I Mill MINIUM 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 72 0 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 7 80 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 7 80 

ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 84 0 

II II II MM 1 1 II I II II I III I II 1 1 II 1 1 1 II III II II 1 1 1 MM I II III III II 

ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 84 0 

CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 900 

MMMMMMMMMMMM MMMMMMMMMMMMMMIMMMI 

CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 900 

CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 960 

I I I II I II II I I I I II II II II II II I II II II M I M II II II I I M II II I II I II I I 
CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 960 

AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 102 0 

IMMMMMMMMMMMMMMMMMMMMMMMMMMMMMI 

AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 102 0 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

I I Ml 1 1 II 1 1 1 II I II II I Ml 1 1 II I II I II II Ml II I II M MM Ml III 1 1 1 II 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 114 0 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 114 0 

TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 1200 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 12 00 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 12 60 

MMMMMMMMMMMMMMMMMMIMMMMMMMMMMMI 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 

AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 132 0 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 132 0 



Qy 


1321 


AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 


1380 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 




A A ■PPIP 1 A df*C*r i C tr PC % '& /^ ,r P2i , T , /*2Zi/TT r rr , 2iO API A^PP^^ A P , P^PT" , P" , P' ,, T ,r PP' p PP» A A PPPAPP AP» A P"» A f~*(~* 
t\i\ l VjL.H.VjVjVjVjL 1 uHL 1 A 1 bHo 1 1 LALAVjM.VjVjVjLALVjVj i VjVj 1 1 L 1 VjAAVjL LAVjVjAVjAVjALL 


1 O Q A 


Qy 


1381 


CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 


1440 






1 1 . 1 , 1 . 1 1 ll< II:: II' ll II. 




Db 


1 ^ ft 1 

-L -J O _L 


PAPA AnrtB^!TTr , TPr , (^!Tnr , r , r , RTa A TTP A TP" A HP A P* A T lr r r P , T"T l pi A P^P 1 7\rir<7\ rn/-i a A P« A PT"Pn 

L/\VjAAVjVjAVj 1 ILILLVjIVjVjVjLAIAAI 1 VjA 1 VjALVjALA 1 111 1VjAUVjAVjCjA1CjAA.CAC.TTC 


144 0 


Qy 


1441 


TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 


1500 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 






1 A A 1 


rprprpz-jpp j\ J\pprpmp APPAA TPTPPPPATAP APPAPPAPPAPPPAPAPPAPPPPn rpr<r<Ornnm\ 

ill Vj 1 AAVjVj 1 i VjAVjLAA 1 Vj 1 LLVjLAI AvjALGAVjVjAvjLAvjLLAGAGGAGGGGATGCCTCCA 


1500 


Qy 


1501 


GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 


1560 






III MM MM MM MM MM MM MM MM MIMM 




nh 


ism 


\jK-H1\LH.± 1 LMALAVj LLL 1 IVjLL 1 LVjVjVjL 1 Vj 1 LL IAVjLL 1 LLLL 1 1 Vj 1 Vj I VjVjLLALA 


1560 


Qy 


1561 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1620 






MINIMUM MIIIMI IMIIIIIMIIIMMMIMMIMIIMMIMM 




Db 


iJOl 


(TTTZl C i C i T±'Vr ,r T ! T'CinT!L f T > ClA TTZ A P'P 1 AT PIP 1 A PI PIP* A TPTTPS r^^'VI^TT 1 A a TPTP 1 ATA PTATTPAT 
vJ 1 IHLLftiL 1 1 VjVjM.1 VjM.1 Vj/\V_L/\1 VjLAVjVjLAI V- 1 1 CAL 1111 VjAA 1 Vj 1 VjA 1 AL 1 A I 1 LA 1 


162 0 


Qy 


1621 


GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 


1680 






Mill MM Ml MM Mill MM MIMIMIIM Mill Ml 




nh 

UD 


1 (TOl 
1QZ1 


vji v_avj1VjAVjAVj1A1 IVjVjIvjI 1 A 1 vjVjAvjVj 1 C AAvjCj 1 1 LTvjLGGACATCAGGTGCCCGGGGT 


1680 


Qy 


1681 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


1740 






llllll M 1 1 1 1 1 1 1 1 II 1 1 1 III Ml 1 II 1 1 1 1 1 1 II II II II II 1 1 1 1 1 II 1 II 1 1 1 1 




nh 


1 ft 1 


iPflPTPA TPPTPPPPTTT 1 APP7\PA PTA P 1 A APPP7\P , APP l r l 7\ iin/iPTPPPPPTiPH ri/*i7» /-imiimmi 
ALAVj 1 LAiHj 1 LLLL 1 1 1 AVjVjALAVj 1 AVjAALVjVjALAVjLLAAVjVjVj I VjVjLGGTGAGGACTTT 


174 0 


Qy 


1741 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGaAACTGTGAAaACCATAAGGGTT 


1800 






Ml MM MIMI III :lll MMM MM MM IIIMMMM 




nh 


-L / *± J- 


PI A A PI A P 1 A P 1 A TATPPPP APTTPP A A r P r PP 1 A APA A t\ rp/~ i a a a P"PriiTn A 7\ 7\ 7\ or<7\ rpTv 7\ nnminn 

VjAAVjALAL A 1 A 1 VjVjVjVjAVj 1 1 VjVjAA 1 1 L AAvjAA 1 VjA I vjAAALTGTGAAAACCATAAGGGTT 


1800 


Qy 


1801 


AAAATAGTAGATGAGGAGGaATACGaAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 






MIMIMIMIIMMMIIMIIIIMMIMMIMI MMMIMIIMMIMM 




nh 


i q a i 


AAAAl Avj 1 AVjAI VjAvjCjAvjVjAArALGAAAGGCAAGAGAATTTCTTCATTGCCCT 


1860 


Qy 


1861 


CCGaAATGGATGGAACGTGGaATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 


1920 






Ml MM IMM III Ml MM. MM MM MIM Mill. 




Db 


1861 


CCGAAATGGATGGAACGTGGAATATCAGGTGTGACAGACAGGAAGCTGACTATGGAAGAA 


1920 


Qy 


1921 


GAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTGGGTGaACACCCCAaACTA 


1980 






Ml MM MM III III MM MM Mill IMM Ml 






X _? Z X 


P2AP1P1AP!P1P^P , A AP APP ATAPP AP AP A T'P^P'P 1 A A APPPAPTA T"Tir<r"r"Tiri 7V 7V ni\ f~s/~\f~\ /""• 7\ t\ Tirirrnv 

vjavjVjavjVju v_aavjAvjVjA l AvjLAvjAVjA 1 VjVjVjAAAvjLLAvj 1 ATTGGGTGAACAClClAAACTA 


1980 


Qy 


1981 


GAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACaAACTGATCAAGAAG 


2040 


nh 




MM Mill MM MM MIM MM IIIMMMM MM MMM 

vjAAVj 1 v^A I LA 1 1 VjAAVjAvj ILL 1 Al vjAGTTCAAGACTACGGTGGACaaACTGATCAAGAAG 


2040 


Qy 


2041 


ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 


2100 






MIM IMM IMMMII III MMM MM MM. llllll Ml 






9 f)4. 1 


P 1 A. A A P'P 1 T n P!lPlP 1 P , T n T n P2P2 r r r rP ,r PP , P , P , P 1 APPPATTPPTPP APPPA PipiA P"T"T»/^ A rnrip t\ <^/^i/^i/^i7\ rp/-*» 
ALAAALL 1 VjVjLL 1 IVjVjI 1 Vj 1 VjVjVjVjALLLAl 1 LL 1 VjVjAVjVjVjALLAVj 1 1 LATGCjAGGlCATC 


2100 


Qy 


2101 


ACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCC 


2160 






1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 II M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 




Db 


2101 


ACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCC 


2160 


Qy 


2161 


TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 


2220 



: 111 II III : 1 1 1 1 1 [ 1 1 1 1 1 1 : 1 1 1 ! 

TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 2220 



Db 


2161 


x vjv- x x x vj.rt v_ x fi\< VJ x v — x vjvnv. x x \* x vjnv« X vj X V_ X X v_» X VjvJnnVJVj X VJ V— 1VJ1 X X ULU X VJ X vj x Vj 


9990 

__ £t £t \J 


Qy 


2221 


CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 


2280 






1 1 1 1 1 1 1 1 II 1 1 ! 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M II II II 1 II 1 II II II II II 1 1 II II II 1 II II II 1 II 1 II II II 1 II II II II 1 1 1 1 




Db 


2221 


ccccppAPAGAGTACTGPPAPGGPTGGGPPTnpTTPGPPGTPTPPATPPTPATPATTppp 

v- v- v_ v*. v*/"_v~r"_\J_~lVJ X rtv— X vjv_ v_,_- , _v_vjvjv_ x OVJvjV_. v_. X vjv_ x XV_vjv_v_vjXV_x v_. v_,rt l^Wl v_rt X v_..rt X X VJ\JV» 


99 RO 

__ Z O VJ 


Qy 


2281 


ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 


2340 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 LI 1 1 1 1 II 1 1 1 1 




Db 


2281 


ATGPTP APPGPP ATP ATTGGGGAPPTGGPPTPGP APTTPGGPTGP APP ATTPflTPTP A A A 

x vjv_ x vAv< v_ vj v_. V^rt x V_-_-_. X X uuuurt\^ V_. X vjvjv_ v_ X ^VJL.rtV^, X X UUuv., X uLnL V-rt X X vjvj 1 L 1 v_.rtrtrt. 




Qy 


2341 


GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 


2400 






I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2341 


GATTCAGTCACAGCTGTTGTTTTPGTGGPATTTGGPAPPTPTOTPPPAGATAPGTTTGPP 

VJ/T. X X V-llU X V_, /T- V_- r~L\J V_ X VJ X X VJ X X X X V_. VJ X VJ VJ V_rt. XXX VJVJv-.rt.V- V- X \ — X VJ X V_» V_, V_. rt.VJ.rt X .rt.V_» VJ XXX VJVw V- 


94 00 


Qy 


2401 


AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 


2460 






I 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 II 1 1 1 1 1 1 

II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2401 


AGCAAAGCTGCTGCCCTCCAGGATGTATATGPAGAPGPPTPPATTGnPAAPGTGAPnGGP 

r^\jv«nnn\j v- x. vj v- j. vjvvv x vvnuun x vj x x x vJViivJfiw VJ v_» v_# x v_- V^*l x x vjvj \^rir\^ VJ x VJ-iV»> VJVJVJ V~ 


9460 

Z *± vj Vj 


Qy 


2461 


AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 


2520 






I 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2461 


AGCAAPGCCGTPAATGTPTTPPTGGGPATPnGPPTPGPPTrJGTPPGTGGPPnPPZVTPTAP 

*i-vj \~.r\r\\ — vjv_ v — vj x > — r^n. x vj x v_ x ill. i vjvjvj v».rt. x \_.vjvjw w X vjvj v_ v_ X vjvj lUvUl uOv>LVJLvii XV X rt.v_. 


9 R9 n 

Z D Z U 


Qy 


2521 


TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 


2580 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 II 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2521 


TGGGPTPTGPAGGGAPAGGAGTTPPAPGTnTPGnPPGGPAPAPTPnPPTTPTPPnTPZiPP 

x vj vj VJ L x v_» x vjv_.rtVjvjvj.rtV_.rt.vjvj.rtVJ X X v, v_.rt.v_.vj X VJ X V_vjVJV_ V_.vJvJV_.rtv_.rtv_. X VJVJ v_ V_, X X v_. X v_. v_vj X v_rtv_ v_ 


Z joU 


Qy 


2581 


CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 


2640 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 I I I I t I I I I I I I I I 
1 1 II II 1 1 1 1 1 II 1 1 1 1 1 II II II 1 II II 1 II 1 1 1 1 1 II 1 II 1 1 1 II II II 1 1 1 1 1 II II 




Db 


2581 


PTPTTPAPPATPTTTPPATTTOTPTPPATPAPPPTHPTPTTnTAPPnA APHPHflPPPPAP 

v— X V_ X X V_rtV_ V_rt x V_ X X X VjV_rt X X X VJ X v_ X VJ v_rt X v_rtvjv_vj X \JL X v_ X X vj -L rtv_, v_.vjrtrtVJVJ v_.V_jvjv_. vjv_.rt.v_. 


z o*± u 


Qy 


2641 


CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 


2700 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 II II 1 ll 1 1 M 1 i 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 Ml 1 1 II III 1 




Db 


2641 


CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 


2700 


Qy 


2701 


AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 


2760 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2701 


AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 


2760 


Qy 


2761 


TTCTAA 2766 




Db 


2761 


Mill 

TTCTAA 2 766 





RESULT 4 
ADH42302 



ID ADH42302 standard; DNA; 2828 BP. 
XX 

AC ADH423 02; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Novel human nucleic acid NOV55a. 
XX 

KW ds; gene; cardiovascular; antiarteriosclerotic ; hypotensive; cytostatic; 

KW anorectic; antidiabetic; immunosuppressive; ant i -HIV; neuroprotective; 

KW nootropic; antiparkinsonian; antiasthmatic; antiinf ertility; 



KW 


cardiomyopathy; atherosclerosis; hypertension; cancer; obesity; diabetes 


KW 


AIDS; multiple sclerosis; graf t-versus-host disease; Alzheimer's disease 


KW 


Parkinson' s 


disease; asthma; fertility disorder; chromosome mapping; 


KW 


tissue typing; preventive medicine; pharmacogenomic ; vaccine. 


XX 






OS 


Homo sapiens . 


XX 






PN 


WO2003102159-A2 . 


XX 






PD 


ll-DEC-2003 




XX 






PF 


04 -JUN-2 003 


• 2 003WO-US01 7R73 


XX 






PR 


04 -JUN-2 002 


• 2002US- 03 85490P 


PR 


04 -JUN-2 002 


• 2002US- 03 RS61 RP 


PR 


04 -JUN-2 002 


• 2002US- 03 85755P 


PR 


05 -JUN-2 002 


♦ 2002US- 03 86041 P 


PR 


06 -JUN-2 002 


• 2002UQ- 03 863 RRP 


PR 


06 -JUN-2 002 


• 2 0 02US- 03863S7P 


PR 


06 -JUN-2 002 


♦ 2002U9- 03 86447P 


PR 


0 6 - JUN - 2 0 0 2 


• 2 0 02UQ- 03R64RQP 

r Z V V Z VJ O OUT J . 


PR 


0 6 - JTTN-9 00 9 


• 9009TT^- 0^ R64.6RP 


PR 


0 6 - ,tttn- ?nn? 

WD U U IN 4^ uuz 


9009TTQ- 0^ QCQCAD 


PR 


07 -JUN-2 00 2 


9 0 09TT C ? - 0 3 R67 0 1 P 


PR 


0 7 -.TTTN-9009 


• 9009TT^- 0^ R67Q6P 


PR 


07 - .TUN- 2 00 2 


* 9009TT9- 03 R6Q71 P 


PR 


07- JUN-2002 


• 9009TT^- 03 R7078P 

r Z \J W Z O U JO / u / or . 


PR 


07 - JTTN-9 0 09 


• 9 009TTC3- 07 R70ft 1 P 


PR 


07 -JTTN-9 00 9 


• 9009TT^- 03 P70R7P 
r z u \j z u o u j o / uo or . 


PR 


1 0 - JTTN- 9 009 

X \J U U1N Z \J \J Z 


• 9 0 09TTC? - 0 7 P. 7 A 9 QP 


PR 


1 0 -JTTN- 9 009 

X U U U IX Z U \J Z 


9 0 09TTQ - 0 7 R 7 R A OP 
z u u z U O U J O / j'lUr . 


PR 


1 0 - .TTTN - 9 0 0 9 


9 009TTQ- 07 


PR 


1 1 -JTTN-9 0 02 

X J- U U1M Z \J \J Z 


• 9 009TTC?- 07 R7fi0fiP 

Z U U Z U O V O O / DUD±r» 


PR 


1 1 -.TTTN- 9 009 


9 0 09TTQ - 07 fl 7^ 1 OD 


PR 


1 1 -.TTTN- 9 0 0 9 

XX U \JVi Z U U Z 


9 O 0 9TTC - 07W7^t^Q"D 


PR 


1 1 -.TTTN- 9 00 9 

XX UVJ1M Z U U Z 


9 O 0 9 TTQ - 0 7 R 7 ^ £ P. TD 


PR 


1 1 -.TTTN-9 009 

XX UUIM Z U U Z 


9 0 0 9 TTQ - 07R"7£Q£D 
ZUUZUO UjO / 070r. 


PR 


1 1 -.TTTN- 9009 

XX U U1M Z U VJ Z 


• 9 0 09TTQ - 0 7 R 7 R R QP 
ZUUZUO UJO / OD3r . 


PR 


T 2 - .TUN - 9 0 0 9 

X Z U U1M Z W W Z 


• 9 0 09TTQ - 0 7 R 7 Q7 4 P 


PR 


1 9 - .TTTN- 9 0 0 9 

X Z U U1M Z U U Z 


9 0 09TTQ - 0 7 R 7 Q£ OP 
ZUWZUo UjO / jOUr . 


PR 


1 9 -.TTTN-9 009 

X Z U UJ.N Z U U Z 


• 9 0 09TTQ - 07 fl R09 9 P 


PR 


1 9 -.TTTN-9 009 

X Z U U1M Z U U Z 


9 0 09TTQ - 0 7 fl R O Q^D 
ZUUZUo U j00U70r . 


PR 


n 9 -..TTTN-9 0 09 

XZ U U1N Z \J U Z 


9 O 09TTQ - 0 7 P R A 7 0 D 


PR 


12-JUN-2002, 


► 2002US-0388479P. 


PR 

XT IN. 


13-JUN-2002, 


2002US-0389123P. 


PR 


14-JUN-2002, 


2002US-0389120P. 


PR 


14- JUN-2002 , 


2002US-0389146P. 


PR 


17-JUN-2002, 


2002US-0389742P. 


PR 


18-JUN-2002, 


2002US-0389604P. 


PR 


18- JUN-2002 , 


2002US-0389884P. 


PR 


19-JUN-2002 , 


2002US-0390006P. 


PR 


19-JUN-2002 , 


2002US-0390144P. 
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XX 

DR WPI; 2004-053467/05. 

DR P-PSDB; ADH42303 . 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, cardiomyopathy, 

PT atherosclerosis or diabetes, in chromosome mapping, tissue typing or in 

PT pharmacogenomics . 

XX 

PS Claim 20; SEQ ID NO 855; 1503pp; English. 
XX 

CC The invention relates to 566 new isolated human polypeptides and their 

CC encoding genes, sequences that are at least 95% identical to these or 

CC sequences comprising one or more conservative substitutions in these. The 

CC polypeptide, polynucleotide and antibodies against the polypeptides are 

CC useful in diagnosing, treating or preventing NOVX-associated disorders, 

CC e.g. cardiomyopathy, atherosclerosis, hypertension, cancer, obesity, 

CC diabetes, AIDS, multiple sclerosis, graf t-versus -host disease, 

CC Alzheimer's disease, Parkinson's disease, asthma, or fertility disorders. 

CC The nucleic acids are further used as hybridization probes, in chromosome 

CC mapping, tissue typing, preventive medicine, and pharmacogenomics. The 

CC polypeptides are also useful as vaccines. This sequence represents an 

CC example of the nucleic acid sequence of the invention. 

XX 

SQ Sequence 2828 BP; 670 A; 692 C; 778 G; 688 T; 0 U; 0 Other; 



Query Match 99.9%; Score 2764.4; DB 12; Length 2828; 

Best Local Similarity- 100.0%; Pred. No. 0; 

Matches 2765; Conservative 0; Mismatches 1; Indels 0; Gaps 



0 



Qy 


1 


Db 


63 


Qy 


61 


Db 


123 


Qy 


121 


Db 


183 


Qy 


181 


Db 


243 


Qy 


241 


Db 


303 


Qy 


301 


Db 


363 


Qy 


361 


Db 


423 


Qy 


421 


Db 


483 


Qy 


481 


Db 


543 


Qy 


541 


Db 


603 


Qy 


601 


Db 


663 


Qy 


661 


Db 


723 


Qy 


721 


Db 


783 


Qy 


781 


Db 


843 


Qy 


841 



ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 6 0 

I III 1 1 1 III I MM I II 1 1 1 MM I III I M 1 1 1 M : 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 122 
TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I MM Mill I Mill MM MIMMIII I MM I Mill MM I III lllllll II 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 182 
ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 180 

1 1 M M M M 1 1 II II III 1 1 1 II II IMIII I III 1 1 II II II II I III 1 1 III II III 

ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 242 
CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 

M MM Mill I Mill MM MIMMIII Mill I Mill MM I MIMMIII II 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 3 02 
TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 3 00 

I MM Mill I Mill MM Mill IMIII Mill MM IMIII MIMMIII Ml 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 3 62 
TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 3 60 

MIIMIMIIIIIIIIIIIIIIIIIIIIMIIIIMMIMIIIMIMIIMIIIIII 

TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 422 
ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 42 0 

M MM I II III Mill MM MIMMIII II MM MM Mill MIMMIMI II 

ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 482 
CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 4 80 

II IIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIMIIMIMIIMIIMIII 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 542 
ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 54 0 

1 1 1 1 II I M 1 1 1 1 1 1 II II II 1 1 1 1 1 Mill II II II 1 1 1 1 II II II I III 1 1 II I MM 

ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 602 
ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 600 

1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 II ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAACATCTA 662 
CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

Mill II 1 1 1 1 1 1 II 1 1 1 1 1 1 II Mill I II II 1 1| || Ml II Mill I II II MM I II 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 722 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 7 82 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1| 1 1 1 1| 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 842 
ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 840 

II 1 1 1 1 1 1 1 1 1 1 M II 1 1 M M 1 1 1 1 II 1 1 1 1 II II II 1 1 II MM 1 1 II II I II 1 1 1 

ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 902 
841 CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 900 



Ill MM ; 1 1 1 111 i MM il lllllllll I INI MIIIMIIIIIIMIIIII 

CACCCTAAGGGCATTGAGATGGATGGGT^AAATGATGAATTCCCATTTTCTAGATGGGAAC 962 



Db 


903 


C ACC CTAAGGG C ATTG AG ATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 


962 


Qy 


901 


CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 


960 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 II I i M M 1 1 1 11 1 1 II II M II II II li II 1 II 1 1 II II 1 1 II 1 II II II II II 1 1 II 




Db 


963 


CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 


1022 


Qy 


961 


AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 


1020 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M 1 1 II 1 II 1 1 II 1 1 1 1 II II 1 1 1 II II II 1 II 1 1 II 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 II II 1 




Db 


1023 


AAGGATCTGAAGC AAAAACACCCAGAGAAGGACTTAGATP AG PTGGTGG AG ATfiOP P A A T 


X \J O £t 


Qy 


1021 


TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 


1080 






1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 II II 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 I 1 I I 




Db 


1083 


TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 


1142 


Qy 


1081 


ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 


1140 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I 




Db 


1143 


ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGGrAARAAnnrr 


1 909 


Qy 


1141 


TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 


1200 






1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 II II 1 1 1 1 1 II II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1203 


TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTGTTT 


1262 


Qy 


1201 


GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 


1260 






1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 II II 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1263 


GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 


1322 


Qy 


1261 


AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 


1320 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 




Db 


1323 


AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTAGAAAACAGAGGATGGTTrTnrr 


X J O £t 


Qy 


1321 


AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 


1380 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


1383 


AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 


1442 


Qy 


1381 


CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 


1440 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 
1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


1443 


CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATOAArAPTTP 


1 ^09 

-L -D \J 


Qy 


1441 


TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 


1500 






I 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 II 1 1 II II II 1 II 1 II II 1 II 1 II 




Db 


1503 


TTTGTAAGGTTGAGCAATGTCCGGATAGAGfiARnAnPAnrrAnAf5nAnnfir;ATnr , rTPrA 




Qy 


1501 


GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 


1560 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 
1 1 1 1 II 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


1563 


GPAATATTPAACAGTPTTPPPTTPPPTPPnnPTnTPPTAfiPPTPPPPTTPTnTnnPPAPA 




Qy 


1561 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1620 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


1623 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1682 


Qy 


1621 


GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 


1680 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 MM! 1 1 1 1 1 MINI 1 II 1 1 II 1 1! II 1 III Mill 1 1 1 1 




Db 


1683 


GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 


1742 


Qy 


1681 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


1740 



III MM Mill I Illlllll llllllllll Mill MIIMMMIMMIMMMII 



Db 


1743 


Qy 


1741 


Db 


1803 


Qy 


1801 


Db 


1863 


Qy 


1861 


Db 


1923 


Qy 


1921 


Db 


1983 


Qy 


1981 


Db 


2043 


Qy 


2041 


Db 


2103 


Qy 


2101 


Db 


2163 


Qy 


2161 


Db 


2223 


Qy 


2221 


Db 


2283 


Qy 


2281 


Db 


2343 


Qy 


2341 


Db 


2403 


Qy 


2401 


Db 


2463 


Qy 


2461 


Db 


2523 


Qy 


2521 


Db 


2583 



1743 ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 1802 
GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 1800 

MM! I Ml! I III 1 1! 1 1! 1 1 1 1 II 1 1 1 1 II I MM I III I llllllllll I MM Ml 

GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 1862 
AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 1860 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 1922 
CCGAAATGGATGGAACGTGGAATATCAGATGTGAC AGACAGGAAGCTGACTATGGAAGAA 192 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 

CCGAAATGGATGGAACGTGGAATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 1982 
GAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTGGGTGAACACCCCAAACTA 1980 

1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 

GAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTGGGTGAACACCCCAAACTA 2 042 

GAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 2 04 0 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I 
GAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 2102 

ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 2100 

1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 

ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 2162 
ACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCC 2160 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCC 2222 
TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 222 0 

Ml Mill I MM I II! II III I MM I MM I Mil 1 1 II MIMI MM 1 1 III 1 1 

TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 2282 
CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 2280 

M f 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 

CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 2 342 
ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 2 340 

II 1 1 1 1 1 1 1 1 II Ml 1 1 1 II MM I Mill II 1 1 1 1 II 1 1 II III III II II I Ml 1 1 II 

ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 2402 
GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 2400 

I ! 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 

GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 2462 
AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 2460 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f i r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 r 1 1 1 1 1 

AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 2 522 
AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 2 52 0 

MM II II II II I II MM Mill II I MMIIIIMI I MM Mill Mill IIMIII 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 2582 
TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2 580 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGGGCTCTGCAGGGAC AGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2642 



Qy 2 581 CTCTTC ACC ATCTTTGC ATTTGTCTGCATC AGCGTGCTCTTGTACCGAAGGCGGCCGCAC 2640 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I II I I I I I 
Db 2 64 3 CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 2702 



Qy 2 641 CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 2700 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I I 
Db 2 703 CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 2762 

Qy 2 701 AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2760 

I Ml I I I I I I I I I II I I I III II I I I I MMII II I I II I I II I I I I II I I I III I I II I 
Db 2763 AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2 822 

Qy 2761 TTCTAA 2766 

I I I I I I 

Db 2823 TTCTAA 2828 



RESULT 5 
ABZ33735 

ID ABZ33735 standard; CDNA; 2966 BP. 
XX 

AC ABZ33735; 
XX 

DT 30-JAN-2003 (first entry) 
XX 

DE Human TRICH encoding cDNA SEQ ID NO 41. 
XX 

KW Human; TRICH; transporter and ion channel; transport disorder; 

KW cystic fibrosis; diabetes mellitus; Parkinson's disease; cancer; 

KW neurological disorder; Alzheimer's disease; Huntington's disease; 

KW immunological disorder; AIDS; asthma; cell proliferative disorder; 

KW transgenic; gene therapy; neuroprotective; antidiabetic; cytostatic; 

KW antiparkinsonian; hypotensive; nootropic; antianaemic; anticonvulsant; 

KW cerebroprotective; cardiant; ant i -HIV; human immunodeficiency virus; 

KW antiasthmatic ; ant iatherosclerotic ; ant igout ; antiarteriosclerot ic ; 

KW hepatotropic; antiinflammatory; virucide; cytostatic; gene; ss. 
XX 
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XX 
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XX 
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XX 
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DR P-PSDB; ABP74104. 
XX 

PT Novel human transporter and ion channel polypeptide, useful in diagnosis, 

PT prevention or treatment of transport, neurological, muscle, immunological 

PT and cell proliferative disorders. 
XX 

PS Claim 96/ SEQ ID NO 41; 146pp + Sequence Listing; English. 
XX 

CC The invention relates to human transporter and ion channel polypeptide 

CC (TRICH) (I) selected from one of 32 polypeptide sequences (ABP74096- 

CC ABP74127) , a naturally occurring polypeptide comprising a sequence having 

CC at least sequence 90 % identity to (I) or a biologically active or 

CC immunogenic fragment of (I) . (I) is useful for screening a compound for 

CC effectiveness as an agonist or antagonist, for screening a compound that 

CC specifically binds (I) or modulates the activity of (I) and for preparing 

CC a polyclonal or monoclonal antibody by hybridoma technology. 

CC Polynucleotides (II, ABZ33727 -ABZ33758) encoding (I) are useful for 

CC screening a compound altering gene expression. (I) and (II) are useful in 

CC a diagnostic tests for a condition or a disease associated with the 

CC expression of TRICH in a biological sample, especially disorders selected 

CC from a transport disorder such as cystic fibrosis, diabetes mellitus, 

CC Parkinson's disease, cardiac disorders, neurological disorders such as 

CC Alzheimer's disease, Huntington's disease, muscle disorders, 

CC immunological disorder such as AIDS, asthma and atherosclerosis, and cell 

CC proliferative disorder such as arteriosclerosis, cirrhosis, hepatitis and 

CC cancer. (II) is useful for creating knock-in humanised animals or 

CC transgenic animals to model human diseases, in somatic or germline gene 

CC therapy, to generate a transcript image of a tissue or cell type, for 

CC detecting differences in the chromosomal location due to translocation, 

CC inversion among normal, carrier or affected individuals and for mapping 

CC genomic sequences. Note: The sequence data for this patent is not 

CC represented in the printed specification but is based on sequence 

CC information supplied to Derwent by the European Patent Office 

XX 

SQ Sequence 2966 BP; 692 A; 725 C; 809 G; 740 T; 0 U; 0 Other; 

Query Match 99.9%; Score 2764.4; DB 6; Length 2966; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2765; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 
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TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 



120 



Db 
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320 
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ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 



180 
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561 
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741 
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801 


Qy 


661 
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861 
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721 
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921 
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781 


Db 


981 


Qy 


841 


Db 


1041 


Qy 


901 


Db 


1101 


Qy 


961 


Db 


1161 



CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 24 0 

I III II Mill IIIIMIIIIIIIIIMIIIMIII II MM III I MM llllllllll 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 44 0 
TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 500 
TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 3 60 

1 1 1 1 II 1 1 II I II I II 1 1 III 1 1 II 1 1 1 II II 1 1 1 1 II II I II I III 1 1 III 1 1 MM II 

TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 560 
ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 420 

1 1 J 1 1 1 M 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 ] 1 1 1 1 j I 

ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 620 
CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 4 80 

I Ml 1 1 1 1 1 II 1 1 1 1 1 II 1 1 II 1 1 II II II II MM II II 1 1 1 1 1 1 II 1 1 Ml I II II II 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 680 

ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 740 

ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 600 

I M I M : 1 1 I M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 800 
CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 III 1 1 1 1 1 1 Ml 1 1 1 M 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 860 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 92 0 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 78 0 

I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | | | | | | | | 
TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 980 

ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 104 0 

CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 900 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i E 1 1 1 1 1 1 1 1 1 1 1 1 

CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 1100 
CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 960 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 1160 

AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 1020 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 122 0 
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Db 


1821 


Qy 


1681 


Db 


1881 


Qy 


1741 




J. J *± -L 


Qy 


1801 


Db 


2001 


Qy 


1861 



TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGTGCCTTCTACCGTATCCAAGCCACTCGT 12 80 
ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 1140 

1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II I II 1 1 1 1 1 1 1 II I M 1 1 1 1 II II 

ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 134 0 
TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 12 00 

II INI 1 1 II 1 1 II II I III 1 1 III I II MM I II 1 1 III I III II MM I Mil I II II 

TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 14 00 
GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 12 60 

1 1 1 1 M 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 14 60 
AAAGGGGGAGACATGTC AAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 1320 

MMIMMMMMMMMMMMMMMMMMMMMMMMMMMMI 

AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 1520 
AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 13 80 

MMMMMMMMMMMMMMMIM MMMMMMMMMMMMMI 

AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 15 8 0 
CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGAC ATTTTTGAGGAGGATGAAC ACTTC 144 0 

I II 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 Ml 1 1 II II 1 1 1 1 1 II I II 1 1 1 II 1 1 1 1 1 

CAGAAGGAGTTCTCCGTGGGC AT AATTGATGACGACATTTTTGAGGAGGATGAAC ACTTC 164 0 

TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 1500 

I I I I II III I II I I M I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 1700 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1760 

GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 162 0 

I I I I I I I I I I I II I I I I I I I II I I I I I I I I M I I I I I I II I I I I II I I M I I I II I I I I I 
GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 182 0 

GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 168 0 

1 1 II I Mill 1 1 1 1 1 II 1 1 MM I II 1 1 1 II II II MM III II 1 1 1 1 II Ml I III I M 

GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 188 0 
ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 174 0 

II I MM II I II I M II MM II IMIMIII Ml II 1 1 Ml III 1 1 II MM II III II 

ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 194 0 
GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 1800 

I II I M 1 1 II 1 1 II II II I II II II M 1 1 II II M 1 1 II I II I II I M II 1 1 1 II II M I 

GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 2 000 
AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 1860 

I II II MM I II II III II II II II I II M Ml II 1 1 MM II 1 1 1 MM II 1 1 1 III I 

AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 2 060 
1861 CCGAAATGGATGGAACGTGGAATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 192 0 



Illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

CCGAAATGGATGGAACGTGGAATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 2120 



Db 


2061 


CCGAAATGGATGGAACGTGGAATATCAGATnTnArAnArAfiRZXAfiPTnArTATnOAAGAA 

v- v- unr\n x x vj vjrviv vj x vj vjxt-t^ x xi. x v-xTVjl"V X VJ X VJnLnUrlV^rVVJunnwVtf X vJ.rt.V_ X rt. x VjVjrtrtvJrtrt 


9 i 90 

x- X -C \J 


Qy 


1921 


GAGGAGGCCAAGAGG ATAGCAGAGATGGGAAAGC CAGTATTGGGTGAACACC C CAAACTA 


1980 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II M M II II II II 1 1 1 II 1 1 II II II II M II 1 1 1 II II II II II II II II 1 1 1 1 1 I 1 1 




Db 


2121 


GAGGAGGPPAAGAGGATAGPAGAGATPPP A A APPPAPTATTPPPTP A APAPPPP A A APT A 

v— 'rtvJvJrtvJvJv- v-rtrtvjrtvjvjrt x ri\Jv.rt\Jri\jrv x VJVjVJxn_r-_f-->Vjv_ V_rt.vj X rt X JL vjvjvj X unnLnw v. L CnnnL x rt 


91 on 
z x o u 


Qy 


1981 


GAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 


2040 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 1 1 II 1 1 II 1 1 II 1 1 1 II II 1 II 1 II 1 1 1 1 1 II II II II II 1 1 1 1 1 II 1 II 1 1 1 1 II 




Db 


2181 


GAAGTCATCATTGAAGAGTCCTATGAGTTrAAnArTArnnTnRArAAArTnATPAAnAAn 

vjx-j-tivj x v-xt x v-x-_> x x vjr-ix--.vjx-i.vj x v_ v_ ini x X v-rtrtvjrt v- X rt v- VJVJ X VJOrtV_rt^Artv- X VJrt X v^rtrtOrtrtvJ 


£, Z f± U 


Qy 


2041 


ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 


2100 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2241 


ACAAACCTGGCCTTGGTTGTGGGGACrrATTPrTnGARRRArrAnTTrATrinAnnPPATP 


9 ~k n n 


Qy 


2101 


ACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCC 


2160 






1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2301 


ACCGTCAGTGCAGPAGGGGATGAGGATGAnnATGAATPPPPnPAnPTAnAPnPTPPPPTPP 

- r * v — v^ vj x v_-.fi. v_j x vj v_ rv\j v_ xt.vj vj vj vjxt. x \Jn\J\Jr\ X VJrt,vjvjrt. X VJrtrt. X V_ v-VJVjvjVjrt,VjVjrt.VJrt.VjVjV_ X Vj v— V_ v- X v- \_ 


9 7 n 

Z 0 0 u 


Qy 


2161 


TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 


2220 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 II 1 1 
1 1 II 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 




Db 


2361 


TGCTTTGACTACGTCATGCACTTPPTGAPTGTPTTPTnnAAGnTr.PTfJTTTP»PPTPTTnTn 

VJ V- x. J. J. j. J- v-xT. J. VJ V— fXV— X X V* V- X VJ-T-.V— X VJ X V— X X V— X VjvJxT.rt.VjVJ X VJV> X VJ XXX VJ V_ v_ X VJ X VJ X VJ 


9-1 9 n 

Z r± Z U 


Qy 


2221 


CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 


2280 






1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 II 1 1 1 1 1 
M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


2421 


CCCCCPAPAGAGTAPTGPPAPGGPTPTPPrPPTpTpTTPPPPnTPTPPATPPTPATPATTPPP 

V-v-v- v- v-v-.i-lv-.x-ivJrt.vJ XrtV_ X VJV_ v-.rtV_VJVJV_ X VJVJVJV_V_ X VJV_ X .Lv-VJV^V_vjXV_ X V-v-rt X V, V- ± v-rtX v-rt X 1 vJVJV_ 


Z*fc 0 u 


Qy 


2281 


ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 


2340 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2481 


ATnPTPAPPPPPATPATTPf5PPAPPTPfiPPTPnpaPT , T , pr;pPTPPappaT , T , r , r , Tr ,r pr'a 2\ a 

rtX VJV_ X V_rt.V_V_VJV_V_.rtX V_rt X lUvjOOi-iLL X VJvjVw v- 1 LbL/iV, X 1 V_,vJVJ v_ 1 VJV_rtV_L..rt 1 X ub 1L1 v^rtrtrt 


O C A C\ 
Z U 


Qy 


2341 


GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 


2400 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2541 


GATTCAGTCACAGPTGTTGTTTTPGTGnPATTTnnPAPPTPTP»TPPPiiP,ZVTAPPTTTnPP 

vJ** J- - 1 - v_.iT.VJ x v_x-J.v-x-i.vJv-. X VJ X X VJ X X X X V_VJ X VJ VJ V_rt X X X VJVJV_.rt.v-V_ ILlul V_ V- v-rtVjrt X rt v- Vj XXX uLL 


z 0 u u 


Qy 


2401 


AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 


2460 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 II 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I | I | | | 




Db 


2601 


AGPAAAPPTPPTPPPPTPPAPPATPTATATPP AP APPPPT'PP.iT'T'PPPZi T±r > r* r vr*7ir'r t nHn 

rtvJv-rtrtrtVJV_ X VJV_ X VJV_V_V_ X V_V_rt.VJVJrt.lVj Xrt Xrt X vJv_rtvJrtV_VJV_V_ X V_ V_rt 1 X VJvJ v-rtrtV- vJ X vJrtv-VJvJvJv- 


zobU 


Qy 


2461 


AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 


2520 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l 
1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


2661 


AGCAACGPPGTPAATGTPTTPPTPPPPATPPPPPTPPPPTnnTPPPTPPPPPPPATPTAP 

xTvJv-x-i_r»>v-\Jv- v-VJ X \_rtrt X VJ X V_ X X V_ V_ X VJVJVJ V_rt X V_VJVJV_V_ X VjVjV_V_ X VJ Vj X v-v-vJ X VjVJv-V_VJv-v^rtX v- 1 xTV_ 


z / z u 


Qy 


2521 


TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 


2580 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2721 


TGnGPTPTPPAPPPAPAPPAPTTPPAPPTPTPPPPPPPPAPAPTPPPP r P r PPT , PPP r rpar , r 1 

vjvjvJv- X v- X VJv-rtvJvJvJrtv-rt.vJVJrt.VJ X X V_ V_rtV_Vj X VJ X V_\JVJV_ V_ VJVJ V_rt,V_rtV_ X VJVJV_V_ X X V_ X V_V_vj X v-rtv- v- 


z / 0 U 


Qy 


2581 


CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 


2640 






I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II M 1 1 1 1 ! 1 1 II II 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2781 


CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 


2840 


Qy 


2641 


CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 


2700 






1 1 1 1 III II 1 1 1 II 1 III II III llllllllll Ml II II III 1 II II MM 1 IMIII 1 




Db 


2841 


CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 


2900 


Qy 


2701 


AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 


2760 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIII 



Db 2 901 AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2 960 



Qy 2761 TTCTAA 2766 

MINI 

Db 2961 TTCTAA 2966 

RESULT 6 
ADH4 2 314 

ID ADH42314 standard; DNA; 2778 BP. 
XX 

AC ADH42 314; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Novel human nucleic acid NOV55g. 
XX 

KW ds; gene; cardiovascular; antiarteriosclerotic ; hypotensive; cytostatic; 

KW anorectic; antidiabetic; immunosuppressive; anti-HIV; neuroprotective; 

KW nootropic; antiparkinsonian; antiasthmatic; antiinf ertility ; 

KW cardiomyopathy; atherosclerosis; hypertension; cancer; obesity; diabetes; 

KW AIDS; multiple sclerosis; graf t-versus -host disease; Alzheimer's disease; 

KW Parkinson's disease; asthma; fertility disorder; chromosome mapping; 

KW tissue typing; preventive medicine; pharmacogenomic ; vaccine. 
XX 

OS Homo sapiens. 
XX 

PN WO2003102159-A2 . 
XX 

PD ll-DEC-2003. 

XX 

PF 04-JUN-2003; 2 003WO-US0 17573 . 
XX 

PR 04-JUN-2002; 2 0 02US - 03 854 90P . 

PR 04-JUN-2002; 2 002US - 03 85615P . 

PR 04-JUN-2002; 2 002US - 03 85755P . 

PR 05-JUN-2002; 2002US- 0386041P . 

PR 06-JUN-2002; 2 002US - 03 863 55P . 

PR 06-JUN-2002; 2 002US - 03 86357P . 

PR 06-JUN-2002; 2002US-03 86447P . 

PR 06-JUN-2002; 2 002US - 03 86459P . 

PR 06-JUN-2002; 2 002US - 03 86465P . 

PR 06-JUN-2002; 2 002US - 03 86864P . . * 

PR 07-JUN-2002; 2002US- 03 86701P . 

PR 07-JUN-2002; 2 002US - 03 86796P . 

PR 07-JUN-2002; 2 002US - 03 8693 IP . 

PR 07-JUN-2002; 2 002US-03 87078P . 

PR 07-JUN-2002; 2002US-0387081P. 

PR 07-JUN-2002; 2 002US - 03 87083P . 

PR 10-JUN-2002; 2 002US- 03 8742 9P . 

PR 10-JUN-2002-; 2002US-0387540P . 

PR 10-JUN-2002; 2 002US - 03 87 866P . 

PR ll-JUN-2002; 2002US-0387606P . 

PR ll-JUN-2002; 2 002US - 03 87610P . 

PR ll-JUN-2002; 2002US-0387659P. 

PR ll-JUN-2002; 2 002US - 03 87668P . 

PR ll-JUN-2002; 2 002US - 03 87696P . 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
PI 
PI 
PI 
PI 
PI 
PI 
PI 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
PT 
XX 
PS 



11- JUN- 

12- JUN- 
12-JUN- 
12-JUN- 
12-JUN- 
12-JUN- 

12- JUN- 

13- JUN- 

14- JUN- 

14- JUN- 

17- JUN- 

18- JUN- 

18- JUN- 

19- JUN- 
19-JUN- 
19-JUN- 

25- JUN- 
06-AUG- 
09-AUG- 

12- AUG- 

13- AUG- 

15- AUG- 
15-AUG- 

26- AUG- 

12- SEP- 

13- SEP- 
23-SEP- 

30- SEP- 
23-OCT- 

23- OCT- 

24 - OCT - 

31- OCT- 
01 -NOV - 
05 -NOV - 



2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 
2002 



2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 
2002US- 



0387859P. 
0387934P. 
0387960P. 
0388022P. 
0388096P. 
0388432P. 
0388479P. 
0389123P. 
0389120P. 
0389146P. 
0389742P. 
0389604P. 
0389884P. 
0390006P. 
0390144P. 
0390209P. 
0391726P. 
0401628P. 
0402268P. 
0402822P. 
0403458P. 
0403617P. 
0403732P. 
0406182P. 
0410085P. 
0410505P. 
0412955P. 
0415195P. 
0420627P. 
0420718P. 
0420852P. 
0422750P. 
0423095P. 
0423748P. 



(CURA-) CURAGEN CORP. 

Alsobrook JP, Anderson DW, Baumgartner JC, Berghs C, Boldog FL; 
Burgess CE, Casman SJ, Catterton E, Dhanabal M, Edinger SR; 
Ellerman K, Ettenberg S, Gangolli EA, Gerlach VL, Gorman L; 
Grosse WM, Gunther E # Guo X, Gusev VY, Herrmann JL, Ji W, Kekuda R; 
Khramtsov NV, Larochelle WJ, Li L, Liang H, Low K, Macdougall JR; 
Maclachlan T, Malyankar UM, Mcqueeney K, Mezick AJ, Miller CE; 
Millet I, Padigaru M # Patturajan M, Peyman JA, Qian X, Rastelli L; 
Rieger DK # Rothenberg ME, Shenoy SG, Shimkets RA, Smithson G; 
Spytek KA, Stone DJ, Sukumaran S, Szekeres ES, Vernet CAM, Voss EZ; 
Wolenc AR, Zhong M, Zhong H; 

WPI; 2004-053467/05. 
P-PSDB; ADH42315. 

New NOVX polypeptides and nucleic acid molecules useful for preventing or 
treating NOVX-associated disorders, e.g. cancer, cardiomyopathy, 
atherosclerosis or diabetes, in chromosome mapping, tissue typing or in 
pharmacogenomics . 

Claim 20/ SEQ ID NO 867; 1503pp; English. 



XX 



CC The invention relates to 566 new isolated human polypeptides and their 

CC encoding genes, sequences that are at least 95% identical to these or 

CC sequences comprising one or more conservative substitutions in these. The 

CC polypeptide, polynucleotide and antibodies against the polypeptides are 

CC useful in diagnosing, treating or preventing NOVX-associated disorders, 

CC e.g. cardiomyopathy, atherosclerosis, hypertension, cancer, obesity, 

CC diabetes, AIDS, multiple sclerosis, graf t-versus-host disease, 

CC Alzheimer's disease, Parkinson's disease, asthma, or fertility disorders. 

CC The nucleic acids are further used as hybridization probes, in chromosome 

CC mapping, tissue typing, preventive medicine, and pharmacogenomics . The 

CC polypeptides are also useful as vaccines. This sequence represents an 

CC example of the nucleic acid sequence of the invention. 

XX 

SQ Sequence 2778 BP; 656 A; 684 C; 764 G; 674 T; 0 U; 0 Other; 



Query Match 99.9%; Score 2763; DB 12; Length 2778; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2763; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


i 


ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II M 1 1 1 II 1 II 1 III 1 1 1 1 


- 


JJD 


10 


ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 


69 


Qy 


61 


TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 


120 






II 1 1 MM 1 III 1 1 1 1 II llllll III 1 1 MM 1 1 1 1 Ml II II 1 1 1 II 1 II 1 Ml 1 1 1 1 




DD 


70 


TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 


129 


Qy 


121 


ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 


180 






MMMMMIIIMMMIIIMMMMIMM IMIIIEI MMMMMIMM 




JJD 


1 T A 

IjU 


ALAb(j(jCA(jAALAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 


189 


Qy 


181 


CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 


240 






1 1 Ml 1 M 1 1 1 1 1 1 II II 1 1 1 1 1 1 III 1 III 1 1 II 1 1 III II II II 1 1 1 MM 1 II MM 




Db 


190 


CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 


249 


Qy 


241 


TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 


300 


Db 


250 


MM M MM Mill llllllll! Mill Mill 1! MM Mill IIMIIIIIMIMI 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 


309 


Qy 


301 


TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 


360 






1 1 1 1 1 1 1 M 1 1 1 1 1 II II 1 1 II 1 1 1 1 1 1 III 1 1 1 II MIMI II 1 1 II 1 MM 1 II MM 




Db 


310 


TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 


369 


Qy 


361 


ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 


420 






MIMMMMIM MM Ml MIMIMI MM MM Ml MMMM 




Db 


370 


ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 


429 


Qy 


421 


CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 


480 






Ml 1 II 1 MM 1 1 1 1 1 Ml 1 1 1 1 1 1 M 1 1 II 1 1 1 II 1 Ml MM II 1 1 1 MM 1 II II 1 1 




Db 


430 


CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 


489 


Qy 


481 


ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | [ | | 




Db 


490 


ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 


549 


Qy 


541 


ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 


600 



Illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 609 



Db 


550 


X v_rl X. X OOv_ tt ILluiUlv- J- rt.v-VJ x \jr\ x v_v-.v_ttOttv_OOtt.Ott.Ott.v_ X v-uv-rirtOfi X v^rtttO v_tt X V_ X tt 


ouy 


Qy 


601 


CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 I 1 1 1 i 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 II M 1 II II II II II II II II II II II II II II 1 II 1 II 1 1 1 1 II II II M 1 II II 1 1 




Db 


610 


CGAGTPTTPTTPATPAPPGPTGPTTGRAnTATPTTTPPPTAPATPTn^lPTPTATATPATT 

v_vj.rt.o X V— X 1L1 x v_tt X V_rtv_ \_ vj V- x Ov_ X X \J\jr\\j Xtt X X X X \Jv.v< X/\\-r\ 1 v> 1 OOV_ X v_ X tt. X tt. X Ott 1 1 


ooy 


Qy 


661 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 I I I I I 
1 1 1 II 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 II 1 II 1 1 II 1 1 1 II II 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


670 


CTGGPAGTPTTPTPPPPTGGTnTnnTPPAGGTTTPPPA APPPPTPPTPaPTPTHTTPTTT' 

>-» x oov_ttO IV- 1 Xv_Xv_v_v_v_X OO X O X OO X v_v~ttOO XXX OUOririVJvJv-v. i CLl v.HL iLlLl 1 L 1 X v- 


/ Z y 


Qy 


721 


TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 


780 






I 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M l l l l l l l l I 

II 1 1 1 1 II 1 II II 1 II 1 1 1 II 1 II 1 1 1 II 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


730 


TTTPPAGTGTGTGTPPTTPTGGPPTG^GTGGPAPATAAAP^JAPTnPTPTTPTAPZil\l\Tl\r , 

xxx v_v_.«.vj xvjxvjxvjxv_v_x 1U1 VjvjV- v- XOOO X OO v_ttOtt XiAr-Vtt.v».vj.ttA». X UL 1L1 1L IrtLHnnlHv. 


TOO 


Qy 


781 


ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 II 1 1 II II 1 1 1 II 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I I 




Db 


790 


ATGCACAAAAAGTAPCGPAPAGAPAAAPAPPGAGGAATTATPATAGAPAPAPAnnnTPAP 

**• x viv.nv-nnnnnu x rt v_ v^ vj v_rtv-rtVJrt v_.rtrtJ^v^.rtv^ v-OttOOtt-tt X X tt, X v_tt X ttOttOttv-ttOttOOO 1 unL 


o *t y 


Qy 


841 


CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 


900 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 M 1 1 1 M 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


850 


PAPPPTAAGGGPATTGAGATGGATGGGAAA ATGATGAATTPPPATTTTPTAP.ATnnnaar' 

v_riv_\_v_ x .ttJ-iooo v_tt x l unvJrt. l O urt 1 uOUnnnn i un. 1 unrt 1 ILLLrtl X X 1 L 1 rtVJrt 1 v3v3v3fin.L 


q n q 
y\)y 


Qy 


901 


CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 


960 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 M 1 1 1 1 1 1 M 1 1 1 II II 1 II 1 II 1 1 1 II 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


910 


PTGGTGPPPPTGGAAG^GAAG^AAGT^P.ATGA^TPPP^^Af5A^A^AT^l\T^ , ^ , ^i^'l\ ^ ^TPTr , 

V- X VJVJ X VJV- V- V_V_ X VJ \Jt-W~\.KJ\J VJrttt.VJ VJrtrtO X OOtt X VJ.tt.Vj X V_V_V_Ov_ttOttOttOtt.l Ottl Lv-vjoH, X Iv-IL 


yo y 


Qy 


961 


AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 


1020 






1 I I I I I I I I I I I I I I I I I I I I I i i i i i i i i i i i i i i i i ii i i i i i i i i i i i i i t i i i i i i 
1 1 1 II 1 1 1 1 II 1 1 II 1 II II 1 II II 1 II 1 II II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


970 


AAGGATPTP A APPA A A A APAPPPAPZ1PA ZlPPZlP r rTA.nZi r nPZlPP r PPP r rpp apaTPrrra at 

tttt.VJVjtt.XV_ X vJtttt.vJv^tt-tt-ttttttv-ttv^v-.v-ttvjttvjrtJAvJOttv- 1 ltt.VJtt.XV_tt.VJV- 1 VJVJ 1 VJOttOtt.1 OOv_ v_tttt.l 


lUZy 


Qy 


1021 


TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 


1080 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I 1 l l l l l l l l I l 1 l I I I I I i i I I i I i I i i i i i i i i i 
1 1 1 1 1 1 1 II 1 II 1 1 ! 1 ! ! 1 1 II II II 1 II 1 II II 1 1 II 1 II II 1 II II 1 II II II 1 II II 




Db 


103 0 


TAPTATPPTPTTTPPP A PP 21 A PZ\PL\ SPAPPPPPPPPTTPTS fPPTSTPP A 7\ H r 1 /" 1 A PTPPT 
■L XttlOv- iv.ll 1 v_v_v^ttv_v_ttttv_ttortttottvjv_v_vjv_vjv_v_ 1 1 v_ lAv-Lvj 1 Al LLAAbLLAL 1 v_o 1 


1 U o y 


Qy 


1081 


ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 


1140 






1 II 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 l l l l l l l l l l I l I I I I I i I I I I 
1 M 1 1 1 II II 1 1 1 1 II 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 1 II 1 II II 1 1 II 1 




Db 


1090 


ATGATPtAPTPiPtTPtPAPtPPA AT ATPPTPZi AP A A L\PZ\TP.PZ\P.PaPzi z\ Pa a pppa apa r^nnnn 

"X vj/ix ortv_ x vjvj xvjv_tt.vjvjv_tttt.XttX V_v_ X OttttOrtrtttv_tt. X uLrtuLiluKrtLMHoLLn/iUf^uvjLL 


1 T A Q 

1 X4 y 


Qy 


1141 


TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 


1200 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 II 1 1 1 II II 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1150 


TrrZ\PPATPZ\nPPflPPTPP2\PZ\PPPZ\TPQPPPTPnPPZ\PTTTaTTTPP A nPfTPT'TPTTT 
X V_ v_ttvjv_tt X vjtt.vjV_vjtt.vjvJ 1 Ov_ttv_ttV_ V_Ott 1 ottov* v_ 1 vjtt.OvJtt.v_ X X XttX X X v_ v_tttt.vjvJ 1L1 It 1 1 1 


1 O Pi Q 

1Z U y 


Qy 


1201 


GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 


1260 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 l l l l l l l l l l l l l l l l I I I I I I I I I I I i i i I i i i 
I 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 




LJXJ 


i oin 


Vjtt.v_v_V_tt.XVj 1 1 v_ 1 Xtt.v_v_.tt.vj 1 Ov_v_ 1 vJvjtt.Otttt.v_ i O 1 vjVjovjv_ 1 O ltt.v_ 1 v_v_ 1 OttV_Avj loo lottOO 


12 69 


Qy 


1261 


AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 


1320 






II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 II 1 II 1 1 1 II 1 I 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1270 


AttAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 


1329 


Qy 


1321 


AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 


1380 






1 1 1 11 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 II 1 1! 1 1 1 1 1 




Db 


1330 


AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 


1389 


Qy 


1381 


CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 


1440 



MM I Mill III II II IIMIMIIIIIIIIIIIIIIIIMIIIII I Mill Mill 



nb 


i ion 

± j y u 


fAPA APPAPTTPTPPPTPPPP ?\TA ATTP A TP A PP A P A TTTTTP A rip t\ t\ rpr-i 7\ tv tv omnpn 

L ALj AAGG AG 1 1 L 1 L LG 1 GGGL A 1 1 1 G A I bAL GALA X 1 1 1 1 GAGG AGG A I G AAL AC 1 1 C 


1 A A Q 

144 9 


Qy 


1441 


TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 


1500 






1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II II II 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 / Cft 
J. *± 3 U 


TTTPTfi A PPTTP A P P A A TPTPPPPPSTRPR /"•J" 1 A PP APPAPPPAP APPAPPPPA TPPPTiPP A 


15 oy 


Qy 


1501 


GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 


1560 






MINI MM M HUM II . II II Mil IL Ml III 




Db 


IjIU 


PIP 1 A ATRTTPS A PAPTPTTPPP 1 1 " 1 'P PPTTT'Pr'P'T'PT'Pr'T 1 a PPPTPPPpfTTrifprtmnnnrnv /"tn 
btAAlAl 1LAALAG1L1 1CCC i 1 GLL i LGGGL IG 1 CC 1 AGClTCllCTTGTGTGGCCACA 


1569 


Qy 


1561 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1620 






MINI III MM IIMMIII MM MM III III IMMIII Ml 




Db 


1570 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1629 


Qy 


1621 


GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 


1680 






MMMM MM III III MMMMMIMM MMMM MMMM 




Db 


-L D J U 


PITPAPTP.APIA P.T A TTPPTPTT ATPP APPTPA Tvppppmpmpppp A PA TPAPPTV'POP/'iPPrim 

Lr 1 L.tt\j 1 (jAbAblAl 1 LtG lol 1 Al GGAGG 1 LAftbb 1 1 C 1 btbbALA 1 LAGG I GLLLGGGGT 


1689 


Qy 


1681 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


1740 




ID 


i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 i 1 1 

APAPJTPATPPTPPPPTTTA PIP 1 APAPTAPA APPPA papppa a /^r*r* r nr*r*r*r^r^rnf m ' a pp a prTimm 
/\LALr 1 LAlbb 1LLLLI 1 1 AbbALAb 1 AbAAbbbALAbbLAAbbb 1 GGLGG 1 bAbbALi 1 X 


174 9 


Qy 


1741 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGaAACTGTGAAAACCATAAGGGTT 


1800 






1 1 1 Ml 1 1 1 II 1 II 1 II 1 llllll II 1 1 1 1 1 1 1 M 1 1 1 1 1 Ml 1 1 1 1 1 Mill 1 1 1 MM 




nb 


i Ten 

1 / DU 


P A APAPAPATA TT^fif^r* A piprppp A A TTP A A P A A TP A TP AAA pmpmp a A A 7\ PP7V m7V 7\ nnnrrirn 

vjAAlaAL.AU A 1 A 1 GGGGAG 1 1 GGAA 1 1 LAAbAA I GA I GAAAL I GTGAAAAlCATAAGGGT T 


1809 


Qy 


1801 


aAAATAGTAGATGAGGAGGAATACGaAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 






MM III MMMM III III III MMMM MM III MM 




nb 


i qi n 

lolU 


A A A A TAPTAPATPAPP APPAATA PP AAA PPPA AP AP A A rprnmnrpmn rpmpppPTrrPPTP A A - 

AAAA 1 Ab 1 AbA 1 bAbbAbbAA 1 ALbAAAbbL AAbAbAA 1 1 1C1 1CA1 1GLLLI 1GGTGAA 


1869 


Qy 


1861 


CCGAAATGGATGGAACGTGGAATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 


1920 






II 1 1 II II 1 II 1 M II MM 1 Ml 1 II 1 1 II II 1 1 II 1 1 II II 1 II II II II 1 II III 1 




nb 


JL O / U 


PPPA A A TP P A TP P A A r*(**TT*r* A A T A TP A P A TP TP APAPAPAPPA A PPTP 7\ /Tn 7\ mo/^« 71 tv /-i 7v t\ 

LLGAAA 1 LtGA 1 GGAALG 1 GGAA I A 1 LAGA 1 G I GALAGALAGGAAGC T GAlTATGGaAGAA 


1929 


Qy 


1921 


GAGGAGGC C AAGAGGATAGC AGAGATGGGAAAGC CAGTATTGGGTGAACACC C C AAACTA 


1980 






MM M MMMM MMMM MMMIMMMMMMMMMMMMMMMI 




nb 

USD 




bAbbAbbLLAAbAbbA I AGCAGAGATGGGaAAGCCAGTATTGGGTGAACACCCCAAACTA 


1989 


Qy 


1981 


GAAGTCATCATTGaAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 


2040 






Ml MIMMII IIMMIII III MM III MM illMMII Ml 




nb 


i qon 


P A APTPA TP A TTP A A P APTPPTA TP A PTTPA A P A PTA nPnTinn 7\ pa A A PmP 7\ m/^i 7\ t\ riT\ tv/~» 

bAAb 1 LA 1 LA 1 1 bAAbAb 1 LL 1 A 1 GAG 1 1 L AAGAC 1 ALGGTGGAlAAACTGATC AAGAAG 


2049 


Qy 


2041 


ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 


2100 






MM Ml MMMM III MMMM Mill III! MM MMMM 




nb 
usd 


o a c n 


ALAAACC 1 GGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 


2109 


Qy 


2101 


ACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCC 


2160 






MM MMMM Ml MIMMII Ml MM MIMMII MM III 




nb 

USD 


nil ft 
Z X J. U 


A f^(^f^ r T l ( m% A P TP P A P A P P P Z™ 1 7\ TP APPATPAPPA nip a A mz-i /^i /-» t\ /^i j^i 7\ t\ r*\/~* /~t m/™i /~i a^i /"tmyi /"*» 

ALLG 1 LAG 1 GLAGLAGGGGAIGAGGATGAGGATGAATClGGGGAGGAGAGGCTGCCCTCC 


2169 


vy 


9 "1 £ 1 
Z _L O X 


TPPTTTP A PT A PPTP A TP P A PTTPPTP A PTPTPTTPTPP A A PPTPPTPTTTP PnmnnnprTW 

IvjtLI i ILtAL 1ALG 1LA1GLAL I ILL I GAL 1G1L 1 IL 1GGAAGGTGCTGTTTGCCTGTGTG 


2220 






MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 




Db 


2170 


TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 


2229 


Qy 


2221 


CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 


2280 






MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 




Db 


2230 


CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 


2289 



ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 234 0 

III II I II I I I I I I I I II II I I I I I Ml II III II II I I II I I II I I I I I III I I I I I I 
ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 234 9 

GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 24 00 

I MM 1 1 II 1 1 1 II I ! I II 1 1 1 1 1 1 1 II II I MM 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1! II I 

GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 2409 
AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 2460 

1 1 1 Ml II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 M 1 1 1 

AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 2469 
AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 252 0 

1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 Ml MINI II I II 1 1 1 1 1 1 1 III II MM M 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 252 9 
TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2580 

1 1 1 1 1 M i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 j 1 1 i 1 1 i 1 1 1 1 1 1 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2589 
CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 2640 

1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 2649 
CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 2 7 00 

1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 2709 
AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2760 

I i 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 

AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2769 



II 



RESULT 7 
ADH42306 

ID ADH42306 standard; DNA; 2778 BP. 
XX 

AC ADH42306; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Novel human nucleic acid NOV55c. 
XX 

KW ds; gene; cardiovascular; antiarteriosclerotic ; hypotensive; cytostatic; 

KW anorectic; antidiabetic; immunosuppressive; anti-HIV; neuroprotective; 

KW nootropic; antiparkinsonian; antiasthmatic; antiinf ertility; 

KW cardiomyopathy; atherosclerosis; hypertension; cancer; obesity; diabetes; 

KW AIDS; multiple sclerosis; graf t-versus-host disease; Alzheimer's disease; 

KW Parkinson's disease; asthma; fertility disorder; chromosome mapping; 

KW tissue typing; preventive medicine; pharmacogenomic; vaccine. 
XX 

OS Homo sapiens. 



Qy 


2281 


Db 


2290 


Qy 


2341 


Db 


2350 


Qy 


2401 


Db 


2410 


Qy 


2461 


Db 


2470 


Qy 


2521 


Db 


2530 


Qy 


2581 


Db 


2590 


Qy 


2641 


Db 


2650 


Qy 


2701 


Db 


2710 


Qy 


2761 


Db 


2770 



YY 
AA 








DM 


WO200310215 


YY 

AA 








pn 


11 


-DEC- 


2003 


AA 








PP 
tr F 


04 


-JUN- 


2003 


YY 
AA 








DD 
FK 


04 


- JUN- 


2002 


DD 

irK 


04 


-JUN- 


2002 


FK 


04 




POO? 

Z. \J \J Z 


DD 


05 


- JUN- 


2002 


DD 

irK 


06 


-JUN- 


2002 


DD 

irK 


06 


-JUN- 


2002 


DD 
rK 


06 


- JUN- 


2002 


DD 

irK 


06 


-JUN- 


2002 


DD 


06 


- JUN- 


2002 


DD 
irK 


06 


- JUN- 


2002 


DD 

irK 


07 


- JUN- 


2002 


DD 

irK 


07 


- JUN- 


2002 


DD 


07 


- JUN- 


2002 


DD 

FK 


07 


- JUN- 


2002 


DD 
FK 


07 


- JUN- 


2002 


DD 
FK 


07 


- JUN- 


2 002 


DD 
FK 


10 


- JUN- 


2002 


DD 
FK 


10 


-JUN- 


2002 


DD 
FK 


10 


- JUN- 


2002 


DD 
FK 


11 


- JUTJ- 


2002 


DD 
FK 


11 


-JTJN- 


2 002 


DD 
FK 


11 


- JUN- 


2 002 


DD 
FK 


11 




9 009 

Z VJ VJ z 


DD 
FK 


1 1 

X X 


-iTTTM- 


9 009 
z u vj z 


FK 


11 


-JUN- 


2 0 02 


FK 


12 


— iTTTKT- 


9 009 

Z U \J z 


DD 

FK 


12 


- iTT TM - 


9 0 09 
z. u u z 


"DTD 
FK 


12 


- JUN- 


2 002 


FK 


12 


- JUN- 


9 0 09 

Z VJ VJ z 


DD 
FK 


12 


- JUN- 


2 002 


DD 
FK 


12 


- JUN- 


2 002 


DD 

IrK 


13 


- JUN- 


2 002 


DD 
FK 


14 


- .TTTN - 


9 009 

Z \J VJ z 


DD 

irK 


14 


-iTTTN- 


9009 


DD 
FK 


17 




9 009 

Z VJ u z 


DD 
FK 


1 ft 

J. o 


- ■TTTW- 


9 0 09 
z u u z 


FK 


1 ft 


U U1N 


9 0 09 

Z VJ VJ z 


PR 


1 Q 

_L -7 


- iTTTKT - 


9 0 09 
z u u z 


DTD 
FK 


1 Q 


~ iTTTNT - 


9 0 09 


FK 


19 


- JUN - 


9 0 09 

Z V W z 


FK 


9 R 
Z, D 


U U1M 


9 0 09 
Z U U Z 


DD 
FK 


Ub 


-AUb - 


ZVVZ 


FK 


09 


-AUG- 


2002 


PR 


12 


-AUG- 


2002 


PR 


13 


-AUG- 


2002 


PR 


15 


-AUG- 


2002 


PR 


15 


-AUG- 


2002 


PR 


26 


-AUG- 


2002 


PR 


12 


-SEP- 


2002 


PR 


13 


-SEP- 


2002 



-A2 . 



2003WO-US017573. 

2002US-0385490P. 
2002US-0385615P. 
2002US-0385755P. 
2002US-0386041P. 
2002US-0386355P. 
2002US-0386357P. 
2002US-0386447P. 
2002US-0386459P. 
2002US-0386465P. 
2002US-0386864P. 
2002US-0386701P. 
2002US-0386796P. 
2002US-0386931P. 
2002US-0387078P. 
2002US-0387081P. 
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2002US-0387668P. 
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2002US-0390006P. 
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2002US-0391726P. 
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2002US-0403732P. 
2002US-0406182P. 
2002US-0410085P. 
2002US-0410505P. 



PR 23-SEP-2002; 2002US-0412 955P . 

PR 30-SEP-2002; 2002US-04 15195P . 

PR 23-OCT-2002; 2002US-0420627P . 

PR 23-OCT-2002; 2002US-0420718P. 

PR 24-OCT-2002; 2002US -0420852P . 

PR 31-OCT-2002; 2 002US-0422750P . 

PR 01-NOV-2002; 2002US-0423095P. 

PR 05-NOV-2002; 2002US-0423748P. 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Anderson DW, Baumgartner JC, Berghs C, Boldog FL; 

PI Burgess CE, Casman SJ, Catterton E, Dhanabal M, Edinger SR; 

PI Ellerman K, Ettenberg S, Gangolli EA, Gerlach VL, Gorman L; 

PI Grosse WM, Gunther E, Guo X, Gusev VY, Herrmann JL, Ji W, Kekuda R; 

PI Khramtsov NV, Larochelle WJ, Li L, Liang H, Low K, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mcqueeney K, Mezick AJ, Miller CE; 

PI Millet I, Padigaru M, Patturajan M, Peyman JA, Qian X, Rastelli L; 

PI Rieger DK, Rothenberg ME, Shenoy SG, Shimkets RA, Smithson G; 

PI Spytek KA, Stone DJ, Sukumaran S, Szekeres ES, Vernet CAM, Voss EZ; 

PI Wolenc AR, Zhong M, Zhong H; 

XX 

DR WPI; 2004-053467/05. 

DR P-PSDB; ADH42307. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, cardiomyopathy, 

PT atherosclerosis or diabetes, in chromosome mapping, tissue typing or in 

PT pharmacogenomi c s . 

XX 

PS Claim 20; SEQ ID NO 859; 1503pp; English. 
XX 

CC The invention relates to 566 new isolated human polypeptides and their 

CC encoding genes, sequences that are at least 95% identical to these or 

CC sequences comprising one or more conservative substitutions in these. The 

CC polypeptide, polynucleotide and antibodies against the polypeptides are 

CC useful in diagnosing, treating or preventing NOVX-associated disorders, 

CC e.g. cardiomyopathy, atherosclerosis, hypertension, cancer, obesity, 

CC diabetes, AIDS, multiple sclerosis, graf t-versus-host disease, 

CC Alzheimer's disease, Parkinson's disease, asthma, or fertility disorders. 

CC The nucleic acids are further used as hybridization probes, in chromosome 

CC mapping, tissue typing", preventive medicine, and pharmacogenomics . The 

CC polypeptides are also useful as vaccines. This sequence represents an 

CC example of the nucleic acid sequence of the invention. 

XX 

SQ Sequence 2778 BP; 656 A; 684 C; 764 G; 674 T; 0 U; 0 Other; 



Query Match 99.9%; Score 2763; DB 12; Length 2778; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2763; Conservative 0; Mismatches 0; Indels 0; Gaps C 
Qy 1 ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

1 1 1 1 1 II II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

Db 10 ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 69 



Qy 61 TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


70 


TTTGTGCTCTTCCTGAATGGTCTTCGA 


1^ > 


Qy 


121 


ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 


180 






IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


130 


ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATPOnArTnrAAnnAnnnTRTPATPPTn 


1 

X O -7 


Qy 


181 


CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 


240 






II MM MMIMIMM IIIMMII MIMIMMMM III III II 




Db 


190 


CCAATCTGGTACrcGGAGAArroTTPPPTTnnnriAPAAnATTrjprAriririTrATTnTPTaT 


O A Q 

<6 ft 27 


Qy 


241 


TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 


300 






II MMMIMMM III MIMIMIMMMIMMM lllllll Ml 




Db 


250 


TTTGTGGCCCTGATATACATGTTGGTTGRRCJTnTrrATPATTnrTnArrrJPTTPATnnPA 


7 fi Q 


Qy 


301 


TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 


360 






II MIMMM III MM MM IIIMM IIMIMI IMIIII II 




Db 


310 


TCTATTGAAGTCATCACCTCTPAAGAGAGGGAnnTnAPAATTAAfiAAAPPPAATPPAna A 


J D -7 


Qy 


361 


ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 


420 






IIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 




Db 


370 


ACCAGCACAACCACTATTPGGnTPTf?nAATnAAAPTPTPTPPA APPTHAPPPTTATPOPP 


4 0 Q 


Qy 


421 


CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 


480 






IIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


430 


CTGGGTTCCTPTGPTPPTGAGATAPTPPTPTPTTTAATTnAnfJTnTPTrJflTPATnnnTTP 


4, q q 


Qy 


481 


ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 


540 






1 1 i 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 




Db 


490 


ATTGPTGGTGATPTnGGAPPTTPTAPPATTnTAf4POAPTnPAPPPTTPAAPATnTTPATP 




Qy 


541 


ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 


600 






1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M M 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 




Db 


550 


ATCATTGGPATPTGTGTPTAPGTGATPPPAPAPnPAPAPAPTPnPAAPATPAArtPATPTA 




Qy 


601 


CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


660 






MM Ml IM MMMMIMMMMMMMMMIMM MIMIII II 




Db 


610 


CGAGTCTTCTTCATCACCGPTGCTTGGAGTATPTTTnPPTAPATPTOnPTPTATATPATT 


O D -7 


Qy 


661 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 


720 






II illl MMMIMMM Ml MMIMMIIMI MMMM MUM 




Db 


670 


V— X VJ\JV»„tt\J X V_ X X»wXV_v_V_V_X \J\J X VJ X uu x LLhyu XXX uuuMiibuL^ X V_ 1 V^M.v— 1L1L1 1L 1 


/ Z y 


Qy 


721 


TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 


780 






MM MM III Ml III MM III MMMIMMM IIIMIIMI 




Db 


7 "3 0 


TTTPPAPTPTPTPTPPTTPTPPPPTr4nir!Tnnr , ar2i\ r r2i z\ arpaPTPPTPTTPTarn a at'a r* 


/yy 


Qy 


781 


ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 


840 






IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 




Db 


790 


ATGCACAAAAAGTAPPGPAPAnAPAAAPAPPnAnnAATTATPATAOAnAPAr5Af:r;r5TnAP 


Oft 27 


Ov 


841 


PAPPPTAAGGGPATTGAGATGnATnnnAAAATnATnAATTPPPATTTTPTAnATPPPAZVr' 

v— /A^v_-v - . x /^r\\j\j\j \^c\ 1. i UrtUn 1 UVJrt 1 uVJUnnnri 1 un 1 VJnn 1 ILLtrtl X X X \* X rtvjrt. X \jVjVj/-\rt.v» 


27 U U 






II 1 1 III 1 1 II 1 1 1 1 1 1 1 Ml 1 1 1 1 II 1 MINI 1 1 1 II 1 III 1 1 II 1 M III 1 II III 1 




Db 


850 


CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 


909 


Qy 


901 


CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 


960 






II II 1 1 1 II 1 1 II 1 1 1 II 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 II 1 Ml 1 Ml II 1 




Db 


910 


CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 


969 



Qy 


961 


AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 


1020 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 I 1 1 1 ] 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 i 1 1 1 1 
1 1 M II II 1 II M M 1 II 1 1 M M 1 II II II II II II 1 1 II 1 1 II II II 1 1 1 II 1 1 II II 




Db 


970 


AAPPATPTPAAPPAAAAAPAPPPAPAPAAPPAPTTAPATPAPPTPPTPPAPATPPPPA AT 


x u £ y 


Qy 


1021 


TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 


1080 






1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 II 1 1 1 II 1 1 1 1 1 1 II 1 1 II 1 II 1 1 II 1 II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1030 


TAPTATPPTPTTTPPPAPPA APAPA APAPPPPPPPPTTPT7APPr!TZVTPPZiZinPP7APTr , r , T 


i noo 

1U07 


Qy 


1081 


ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 


1140 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I I | I | | I I I I I I I | | | | | | | | || | | 




Db 


1090 


ATGATGACTGGTGCAGGPAATATppTPAAGAAAPATPPAPPAPAAPAAPPPAAPAAPPPP 


1 1 4. Q 


Qy 


1141 


TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 


1200 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l I I I I I I 
1 1 1 1 1 1 1 1 II 1 1 M 1 II 1 1 1 1 1 II II 1 II II 1 II II 1 II 1 1 II 1 1 1 II 1 1 II II 1 1 1 1 1 1 




Db 


1150 


TPPAPPATPappp APPTPPAPAPPPATP7APPPTn7AP.ri7APTTTzv.TTTr i r , A sppTPTTfTTT 1 


i o r\ q 


Qy 


1201 


GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 


1260 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I I I I i 
1 1 M 1 1 M : 1 i 1 1 1 i 1 1 1 1 1 II II II II 1 1 1 1 II 1 1 II 1 1 1 1 II 1 1 1 II 1 1 1 II 1 II 1 1 




Db 


12 10 


GAPPPATPTTPTTAPPAPTPPPTPPAPA aptptppppptpttxPtpptptaptaptpptptapp 


x/, o y 


Qy 


1261 


AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 


1320 






II 1 1 1 1 i 1 1 1 1 1 1 I 1 I I I I I I I I I 1 1 1 1 1 1 1 1 i i i i i i t i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 M 1 II 1 II 1 II 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


1270 


rirtrt.uuuuurtvjrtL,rt lul rt/lrturt.L ^rt X \j X ±\ X \j X uunL X HLHHrtALHIjnooA 1 1 1L1 bLL 


i y 


Qy 


1321 


AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 


1380 






1 1 1 1 1 1 1 i 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 I I I I I I I I I I I I I t I I I I I i I I I I i i i 

1 M 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 II II 1 1 1 II II 1 II I I I I I I I I I I | | I I | | | | | | | 




Db 


1330 


A ATPPAPPPPPTP aptatpzaPttptapzaptapppptappptppttptpta BprrnnnAPAPApp 


i j o y 


Qy 


1381 


CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 


1440 






i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 I 1 I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 t i 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 II II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II M 




Db 


13 90 


PAPAAPPAPTTPTPPPTPPPPATA A TTP A TP A PP IV P7ATTTTTP2\PP7APP7A TP 7A ftPftPTTP 
v^ftUArtuvjAVj x x v_ x LLu x uuuUH x .M-rt. X X X uHLunLH X X X X X uHuonbuH 1 VjAALAL 11L 


T A A Q 

144 y 


Qy 


1441 


TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 


1500 






1 1 1 l l l i l l l i 1 l l i I I I I I I I I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 




Db 


1450 


TTTPTAAPPTTPAPPA ATPTPPPP AT AP7APP7APP7APP 7A ppp i\ni\ pp i\ nnnn a Tnnnn^nn a 


1 C A Q 

±d uy 


Qy 


1501 


GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 


1560 






Ml I M 1 1 II 1 1 1 1 II 1 II MINIMI III 1 1 1 1 1 II II 1 1 II MM 1 II 1 II II 1 1 II 




Db 


1510 


GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 


1569 


Qy 


1561 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1620 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l l I I I I I I I I i i i i i i i i i i i i i i i i i i i i i i i i i i i 

M M 1 1 1 II 1 1 II 1 1 1 II II II II 1 II II 1 M II 1 II 1 1 II 1 II 1 1 1 1 1 II II M 1 1 1 1 1 




Db 




PTTZi PP 7A TP TTPP L\ TP 2\ TP APPA TP PflPPPATPT'TPA r'TTTTT 1 7V A r rPTP7\T7\PT7\T"Tr'7\T> 

x inLUhiL x 1 bbH 1 bii 1 bHLLH 1 bLHbbLH 1 L 1 1LAL 111 1 oAA x o 1 \j A 1 AL. 1A1 1 LAI 


162 9 


Qy 


1621 


GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 
1 1 1 1 1 1 i 1 1 i i I I I I I I I I I I I I 1 i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 


1680 


Db 


X O «J \J 


M 1 1 1 1 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 II II M II I || || 1 | | | || | | || | || | | | | | | | | | 

PTP A PTP 7a P7APT7ATTP ptpttta tp p a ppt pa 'Ar , r % T tr vr tr pr , r , r t r t apatptn npmrip / nnr , nnn'Ti 
blLAblbHbHblAl XooxLjx 1 A 1 oVjALjLj 1 L AALjVj 1 1 L 1 vjLooALAx LAvjvj 1 vjCLLCjVjGGT 


1689 


Qy 


1681 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


1740 






Ml Ml 1 M 1 1 1 1 1 II 1 MIMI II 1 M 1 1 1 M 1 1 1 M 1 1 M 1 M M M 1 1 M 1 1 1 1 1 II 




Db 


1690 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


1749 


Qy 


1741 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 


1800 






1 II 1 1 M 1 1 1 II 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 M II 1 M II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1750 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 


1809 



Qy 


1801 


AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 






1 IMMIMI II! MM MM MM III II |||| III Ml III 




LJU 


A. O ± U 


A A A AfTVPT'AP A TP 7\pp7\ PP A A T 1 A P 1 P A A7\PPP7\ 7\P7\P)\ 7\ rprnmrirnmri A rni imnn/i'liinnnnr'/'i 7V 7\ 

AAAA 1 AC 1 AC A 1 CACCACCAA 1 ACCAAACCCAACAGAATTTCTTCATTGCCCTTGGTGAA 


i O /T ft 

1869 


Qy 


1861 


CCGAAATGGATGGAACGTGGAATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 


1920 






MM 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 II 1 II II 1 1 Ml 1 Ml 1 II II 1 1 Ml 1 1 II 1 1 II 




Db 


1 R70 
iO / u 


c*cc± z\ Zi zxTTir! z\ Tnnzi a r'nnrr^n z\zi r r i ziT i p , zif" , z\ Terr* apapapappa APPTPAPTAfrppA a p a a 


192 9 


Qy 


1921 


GAGGAGGC C AAGAGGATAGC AGAGATGGGAAAGCCAGTATTGGGTGAACAC C CCAAACTA 


1980 






M II II II 1 1 1 II 1 1 1 1 II 1 1 III 1 II 1 1 1 II 1 1 1 M 1 1 III 1 1 1 II 1 1 III 1 1 1 M 1 1 1 








n 2\ A fff r* H. APAPPATAPPAPAPA TPPPA A APPPAPTA r nT , /^ , /~ , /^T»0 A APAPPnns A 7v Pm7v 

uAubHbutLMvjAbuA 1 AbL, ACACA 1 bbbAAAbLLAu 1 A 1 1 CCC 1 CAAC AC C C CAAACTA 


1989 


Qy 


1981 


GAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 


2040 






1 III MM! 1 MM 1 MM 1 llllllll 1 MM 1 Mill Ml! MM Mill! III! 1 II 




LJU 




p A APTPATP A r T" lr PP A APAPTPPTA T^P 7* /-» rpr-p/"-* 7\ 7\ p 7\ pen 71 o/™</-irn/-i/^i 7\ f"i 7\ 7\ 7\ nrrn 7\mri7\ 7V/^itv t\ 

laAAC 1 tA 1 IA 1 1 CAACAC 1 CC 1 A 1 CAC 1 1 CAACAC TACGGTGGACAAACTGATCAAGAAG 


2049 


Qy 


2041 


ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 


2100 






Ml 1 1 IIIMI II Mill! Ill MM! IIIIIMIIII MM MM MIMMIIM 1 




LJU 


U D U 


A r 1 Zi A a r*r*^r t r t r f r*'V'vr i r , 'T tr pr tr pr*r t r t r* a ppp a ttppt^pp appp a pp a prpmp7.fn/«in7v nnnnn m/"i 
ALAAAtL 1LxLjL~L. 1 lljljl iblbuuuALLLAl 1 CC i CCACCCACCAC I 1 CArCGAGGCCATC 


2109 


Qy 


2101 


ACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCC 


2160 






MIMIIIIIIIIIIIIIIIIIIIIMIIMIIIIIMIIIIIIIIIIIIIIIIIIIIII 




LJU 




APPPTPAPTPPAPPAPPPPA rnp a PP A ""PP A PP ATPA 7\ ^n/^*r^r*r*r^r^ 7v /^i /~i 7\ /17\ PP /"irn/"i/"i/*i/"irn/~</''i 

AUCC 1 LAb 1 CCACCACCCCA 1 bAbbA 1 CACCA 1 CAA1 CCGGGGAGGAGAGGCTGCCCTCC 


2169 


Qy 


2161 


TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 


2220 






1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 




LJU 


z 1 / u 


•"PP 1 P lr P r P r TP 1 A PT 1 A PP T^P A T'PP A PTTPPTP A prnprppmmpmpp a uppmpPTTi rprprp/--i ^ /~irn/~i rp/-i rn/~i 

1vjC1 1 IbAL 1ACC1CA1CCAC 1 ICC i CAC 1C 1 C 1 1 C 1CCAACC ICC 1 GTTTGCCTGTGTG 


2229 


Qy 


2221 


CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 


2280 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




uu 


noon 
z z j u 


PPPPPPAPAPAPTA PTP PP A P^OP 1 P tr PP , P 1 P'P 1 P"T , P i P i, T l, T , P 1 / r ^ / r ^/~ 1 / r ^T l /~"T'/^/^7\ rppPTTiA rpnA rnmorm 

^ ^ LbLLAL AVj AC 1 AC 1 bLLALbbb 1 CCCCC ICC 1 1 CCCCC 1 C 1 CCA 1 CC I CATCATTGGC 


2289 


Qy 


2281 


ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 


2340 






1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




uu 


Z Z u 


A T'P 1 PTP A P PP P P A TP A TTPPPP A P'P^T'P'P' P^P^T'P^P' P 7V /"""T"""P/""t/~i/'"i /"im/""! /™i 7\ /~i /~i t\ rnrn/^i /*i m/~irn/^» 7\ tv tv 

A ICC 1CACCCCCA1CA1 1CCCCACC ICCCC 1 CCCACTTCGGCTGCACCATTGGTCTCAAA 


2349 


Qy 


2341 


GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 


2400 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 




JJJJ 


0 *5 c n 

Z J DU 


P AHPTPAPTPA PAP PTPTTPTTTTPP<"PPP P 7\ fTirprp/^n/*! 7\ riOTinrrnmnnn 7v /""i 7v rn 7v rt/immm/i/s/i 

bAl 1 CAC I CACACC 1 C 1 1C1 1 1 lCCiCCCATTTGGCACCTCTGTCCCAGATACGTTTGCC 


2409 


Qy 


2401 


AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 


2460 






1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




UU 


Z ft ± u 


APPA A A P 1 PTP PTP P P PTP P APPA mprpi\ rp a T'PP 7\ p 7\ pppprripp7v rnrp/^i t~\ 7v 7\ m/^i 7\ r~tr~t t~\ r*t r~\ 

AO u AAAb C1CC1CCCC1C C ACCA i C 1 A 1 A 1 C CAC AC G C C T C C ATTGG C AAC GTG ACGGG C 


2469 


Qy 


2461 


AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 


2520 






1 II 1 1 1 1! 1 1 1 1 1 II II II 1 II MM! 1 II III II 1! 1 1 II 1 1 1 1 1 II 1 II 1 II 1! 1 1 




jjjj 


Z 4 / u 


APPA A C*C* r^C^C^T^r^T^. A r PP r PPT"T'PP'T»PPPP7\ iTinnrinP'rn/'iriorTi/'infn/^ririrTiri/i/iririrtnTi m^irmv /~i 

ACCAACCCCC 1 CAA IC1C1ICC1 CCCCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 


2529 


Qy 


2521 


TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 


2580 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 




UU 




TnnflPTPTPlP APPP a PAPP APTTPPA PPTP r T 1 PPPPPPPPAPAP r PPPPPT ir rr ,r ropr"fTinA pp 
HaVjoU 1^ 1 O^A^jLjLjAL-ALjCAC I I CCACC lb 1 CCCCCCCCACAC ICCCC 1 1C1CCCTCACC 


2589 


Qy 


2581 


CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 


2640 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 M 1 1 1 




Db 


2590 


CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 


2649 


Qy 


2641 


CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 


2700 



II 1 1 II III II I II III I II IIIIMI Mill MM 1 1 II Mill I II II llllllll 1 1 

Db 2 650 CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 2 709 

Qy 2701 AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2 760 

1 1 II 1 1 III 1 1 1 II MM II I II I II MM II II II 1 1 II II III II I II II IMIIMI 

Db 2710 AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2 769 

Qy 2761 TTC 2763 

Ml 

Db 2770 TTC 2772 



RESULT 8 
ABN83428 

ID ABN83428 standard; cDNA; 2782 BP. 
XX 

AC ABN83428; 
XX 

DT 21-AUG-2002 (first entry) 
XX 

DE Human transporter protein coding sequence. 
XX 

KW Human; sodium/calcium exchanger; transporter; brain; heart; kidney; lung; 

KW spleen; testis; leukocyte; foetal brain; chromosome 14; gene; ss. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
FT CDS 10. .2775 

FT /*tag= a 

FT /product= "Human transporter" 

XX 

PN WO200233086-A2 . 
XX 

PD 25-APR-2002. 
XX 

PF 17-OCT-2001; 2 00 1WO-US032 152 . 
XX 

PR 17-OCT-2000; 2000US-0240836P . 
PR 13-MAR-2001; 2 001US - 00804474 . 
XX 

PA (PEKE ) PE CORP NY. 
XX 

PI Merkulov GV, Ketchum KA, Shao W, Yan C, Di Francesco V; 

PI Beasley EM; 

XX 

DR WPI; 2002-479677/51. 
DR P-PSDB; ABB83246. 
XX 

PT Human transporter peptide related to sodium/calcium exchanger subfamily 
PT for identifying modulators useful for treating a disease or condition 
PT mediated by human transporter protein. 
XX 

PS Claim 4; Fig 1; 200pp; English. 
XX 

CC The present sequence is the coding sequence of a human transporter 
CC protein, which is related to the sodium/calcium exchanger subfamily. 



CC Experimental data indicates expression of the transporter gene in humans 

CC in brain, heart, kidney, lung, spleen, testis, leukocyte and foetal 

CC brain. The gene of the transporter was mapped to chromosome 14 by ePCR 
XX 

SQ Sequence 2782 BP; 655 A; 685 C; 766 G; 676 T; 0 U; 0 Other; 



Query Match 99.8%; Score 2761.2; DB 6; Length 2782; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2763; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 



Ov 


l 


ATGGCGTGGTTAAGGTTGCAnPOTPTrArrTOTfirrTTrPTPPATTTTnnrJPTnnTTZir'P 

x uu\> vj x x x rinvjvj x x vj^nvj^v J. v_» x LnvL> 1L1 U\_\_ l l^L l V_/A x X X X vjUUv^ x uu x X \— 


o u 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 N 1 II 1 1 1 1 M II 1 1 II II II II II II II I 1 II 1 II 1 I II 1 1 II 1 1 1 II II II II 1 I 




Db 


10 


ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 


69 


Ov 

wy 


O J. 


TTTPTPtPTPTTPPTPA atpptpttppappapappptpptppptpapppp apptpppa app 

1 1 lul 1U1 1 V— V~ x \Jr\t\ X Uu iL.ll LvjM\jLMVjM\joL x 1 uuL 1 LAVjVjvjvjALw X VjLLAA\jL 


ion 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 II 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 II 1 | M 1 1 1 1 II 1 1 I I 1 II 1 1 




Db 


70 


TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 


129 


Ov 


121 


A P A p*p»PP A a A A p A A th a ptp p thttp a cxctcwn a Tpr^n a ptp pa app appptptpatpptp 

rtU/iVjuuLKurtrtV-rtH 1 unu iLLlul l LHuuu 1 LA X LovjAL 1 VaLAAovxAookj lul LA ILL 1 Vjf 


lb u 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


130 


ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 


189 


Ov 
vy 


-L O X 


P P A A T ptpp»t zv P p p an A a A a p p p t t p p p ttpppp a p a a p a t tp p p a p p p t p ATTPTprrAT 

\_L.rtr\X L X OvJ X nUv-.v_.uunur\HLLL 1 1L.LL1 1 uuvjUnLnnuA 1 1 vjLLAvjvjL* 1 LAI lulL Inl 


O A Pi 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I I I M 1 II 




Db 


190 


CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 


249 


Ov 

vy 


94. 1 


x x 1 \j x uvjLUL luHiHIHLHIu 1 ILL 1 1 buuulul LLAJL LAI IbLluALLbL 1 1 LAIVjVjLA 


*5 n n 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II II II II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I 1 I I I M 1 




Db 


250 


TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 


309 


Ov 

vy 




TPTATTOA AOTP ATP APPTPTPA AP AP APPP APPTP APA ATT A APA A A PPPA BTPPAPA A 
x L 1 ±\ 1 X \jf\r\\j x LA 1 LALL 1 L 1 Lnnbfiu/iuuunbo 1 vjAL AA 1 1 /ittunAALLLHA X ubnoAA 


"3 C f\ 






I 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

II M 1 II 11 1 II II II 1 II || I || || || I I I | I | I | | | | | | | | | | || | | | | | | || | | | | | 




Db 


310 


TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 


369 


Ov 

vy 


JDl 


APP APP A P A A PP A PTATTPPPPTPTPP A A TV AAA r ,r Pr"T , r %r TT*r t 7\ A f" 1 /" ,, TV A /" i r*/"'"T""P A TT"* r~* 

HLLHbLHLHALLAL 1A1 1 LL*L»L* x L IboAAloA/iAL luiLI LLAALL ILrALLL 1 1 Al bbLL 


4 2 U 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


370 


ACCAGCACAACCACTATTCGGGTCTGGaATGAAACTGTCTCCAACCTGACCCTTATGGCC 


429 


Ov 

vy 


AO 1 


PTPPPTTPPTPTPPTPPTPAPATAPTPPTPfTPTTTA A mmp -n /^^rp^rp^rp/-»/-irp^i t\ rp^-i /-ly'-i rnrn /"i 

tluuul ILL 1L 1 vjL 1 LL 1L\AL\A1 AL ILL 1L 1L 1 1 1 AAI 1 VjAvjL* 1 vj 1 vjr 1 KjKj 1 L A 1 oLnj 1 J.L 


4 80 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I 1 1 

1 1 1 1 1 M II 1 1 II 1 1 1 1 II M II 1 1 II 1 II II 1 1 II 1 1 II 1 1 1 1 1 1 1 II 1 II 1 1 II II II 






*t J V 


Llbbbl ILLlLlLxLILLlVaALiAlALlLLlLlLl 1 lAAi 1 LrACjCjTGTGTGGTCATGGG T TC 


489 


Qy 


481 


ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCaACATGTTCATC 


540 






II 1 1 1 II 1 1 Ml 1 II II 1 II 1 1! II MINI II III 1 1 1 1 1 MM 1 1 II 1 1 1 1 II 1 Ml 1 




Db 


490 


ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 


549 


Qy. 


541 


ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 


600 






M M 1 M 1 1 II 1 M M 1 1 1 1 M 1 1 1 1 M 1 1 M M 1 II 1 1 1 M M M M 1 1 1 M 1 II 1 M 




Db 


550 


ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 


609 


Qy 


601 


CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


660 






II 1 M 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 II 1 1 




Db 


610 


CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


669 


Qy 


661 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 


720 






1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 III 1 II 1 II 1 1 1 II M Mill 1 II II II II Ml 1 II II Ml II 




Db 


670 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 


729 



Qy 


721 


Db 


730 


Qy 


781 


Db 


790 


Qy 


841 


Db 


850 


Qy 


901 


Db 


910 


Qy 


961 


Db 


970 


Qy 


1021 


Db 


1030 


Qy 


1081 


Db 


1090 


Qy 


1141 


Db 


1150 


Qy 


1201 


Db 


1210 


Qy 


1261 


Db 


1270 


Qy 


1321 


Db 


1330 


Qy 


1381 


Db 


1390 


Qy 


1441 


Db 


1450 


Qy 


1501 


Db 


1510 



TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 7 80 

1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 M 1 1 II I II II I III 1 1 III 1 1 III II 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 7 8 9 

ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 84 0 

M 1 1 II II I II 1 1 1 1 II I III 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 II II 1 1 ll 1 1 II 1 1 1 1 II II 

ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 84 9 

CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 900 

M II III! IMIM llllllllllll I Ml Mill I III I II II I MINIM Illllll 

CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 909 

CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 960 

I I I I I I I I I I I I 1 I I I I I II I I I II I I I I I II II I I I I I I I I I I I I I I I I I I I I I I IN 
CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATCCTC 969 

AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 102 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 

AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 102 9 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 Ml 1 1 1 M II II 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 108 9 

ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 114 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 114 9 

TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 1200 

II II II II II I I I I I I II II I I II I I II I II M I II II II II I I I II I I I I I II M II I I 
TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 12 09 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 12 60 

II 1 1 II 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II III II II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 Ml 1 1 1 II II 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 12 69 

AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 1320 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i E E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 132 9 

AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 13 80 

1 1 1 1 M 1 1 1 Ml 1 1 1 1 1 1 1 1 1 Mill II Mill Ml 1 1 1 1 1 1 II 1 1 1 Ml II 1 1 MM 1 1 

AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 13 89 

CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 144 0 

I I I I I I II I M II I I I II I I I I I I I II I I I I I I I I I I I II II I I I I I I I I I I II I I II I 

CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 144 9 

TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 150 0 

II 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 II II 1 1 II 1 1 II I II 1 1 1 1 1 1 1 1 1 1 II I II I M II 1 1 1 1 

TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 1509 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

II 1 1 1 1 M I M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M I II 1 1 1 1 II 1 1 1 1 II M M I 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1569 



Qy 


1561 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1620 






III 1 III! MM 1 MM Ml! MINI III 1 MM MM 1 MM 1 MINIM 1 MINI 




Db 


lj / U 


/*"irprp7v (~if~iT\ r T , P" 1, T"T , f'"' , / r "' , A TP" 1 A TP" 1 A nPATPP7\PPP7\ pn/-iiTi'Ti0 7\ /-iipi i u i \rn/-% A a rp/-irp/~i A Ti A <^trp a nvrinx rp 

Lrl lUUAi^Al^jAL-LAiljCAvaCjLATCTTLAL 1 1 1 1 VjAA 1 o 1 CjATACTATTCAT 


1629 


Qy 


1621 


GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 


1680 






MMIM MM 1 Mill llllllll MM lllllllll 1 MM 1 MM MM lllllll 




Db 


IOjU 


blL/i^jlbAbrtblAl IL7L7J.L7I 1 A 1 L7L7AL7L7 1 L AAL7L7 1 1C_ loLooALAlCAooTGCCCGGGGT 


1689 


Qy 


1681 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


1740 






III Mill 1 III 1 lllllllll MIMMMIMI Mill MM lllllllll MMIM 




Dh 


i con 

J. O u 


ALALj 1 LA 1 LLj 1 L, LLL. 1 1 1 AL7L7ALAL7 1 ALTAAovjLrALAuCLAACjvavjr I L7L7CGGTGAGGACTTT 


1749 


Qy 


1741 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGaAACTGTGAAAACCATAAGGGTT 


1800 






III Mill INI lllllllll llllll III Mill Mil! Mill lllllllll MINI 




USD 


± / O U 


(~* A APAPAPATA TPPPPAPTTPP A A TTP"< A 7\P7\ 7i TV* A mp AAA nmpmp a a A Tiom\mn 7v nnnmn i 

L7AAL7AL AL. A 1 A 1 LtLjLjLjALj 1 1 0L7AA 1 1 L AACjAA TCjAI GAAACTGTGAAaACC ataagggtt 


1809 


Qy 


1801 


aAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 






Ml III 1 1 1 1 Ml 1 1 llllllll 1 III Mill MM 1 MM 1 III 1 Mill MM! 






1 oi n 
lolU 


AAA ATZiPTBPATPAPPAPP A ATAPPA AAPPPAAPAPAA TTTP~ 1 T r PP''A rprp^-i >^i/^ir*ir r 11 1 ipprpp 71 7s 

-H-rt-rti-i I M\j 1 H.uH 1 0AL7L7AL7L7AA 1 ALoAAAvjLtLAALtALtAA i 1 1L1 1 LA 1 1 LrLL-L. I 1 L7L7 1 bAA 


1869 


Qy 


1861 


CCGAAATGGATGGAACGTGGAATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 


1920 






MIIIMIIMIIIIIIIMIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIII 




nh 


i fl7n 

lO / u 


PPfJZi Zi Zl TY^Cl A Tflf2 A SPPTPPfl ATATPAPATPTPAPAPAPAPPA a PpmP APTA mpp a AO A A 
LLbHHH 1 LtLjA ± L7L7 AALLj 1 L1L7AA 1 A 1 L, ALtA lbl VjAL ALjAL, AoLxAAvjL. 1 LjAL, 1 A 1 CjvjAAoAA 


1929 


Qy 


1921 


GAGGAGGCCAAGAGGATAGCAGAGATGGGaAAGCCAGTATTGGGTGAACACCCCAAACTA 


1980 






MMIIIMI Mill MM Mill MM Mill lllllll MM MMIIMMIMM 




Db 


1930 


GAGGAGGCCAAGAGGATAGCAGAGATGGGAaAGCCAGTATTGGGTGAACACCCCAAACTG 


1989 


Qy 


1981 


GaAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 


2040 






1 MM MM Mill MM Mill IIMIIMI MM III MM MMIMIMM MM 








P? A AP-TPATPA TTP" 1 7\7\P ApmpprnTimp a nmmri a a /~i a z^rp a PPPHnP A O 7\ A 7\ /"irp/-i A rn/~i 7\ a /*» A a 

oaavj i LA 1 LA 1 1 L7AAL7AL7 ILLlAi L7AL7 1 1 LAALxAL 1 ALLAj 1 CjCjALAAACTGATCaAGaAG 


2049 


Qy 


2041 


acaaacctggccttggttgtggggacccattcctggagggaccagttcatggaggccatc 


2100 


nk 


jU 


1! 1 1 1 1 1 M 1 1 1 1 1 1 MM 1 1 1 II III III Mill 1 1 II 1 II II 1 1 1 II 1 II II II 1 II 

]\PAA A P^P^TOP" 1 P" , P" , TTP" , P ,,, TTP" , T/'*'P"''P"''Pt A f~* /""i /""* A T""p^"i/"inpfi/"'i 7\ ^1^1 /~i a a pmrnp a rp/""i/"*» 7\ /-» /~i /"t 7\ m r~\ 

ALAAALLILtLjLlI ICjCjI ICjTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 


2109 


Qy 


2101 


ACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCC 


2160 






llllll IIMIIMI MM Mill lllllllll MM lllllll III IIMIIMI MM 




nh 


nil n 
Z ± ± U 


ALLL7 1 LALr 1 LtLAoLALoooA 1 L7AL7L7A J. bAbCjArGAATCCGGGGAGGAGAGGCTGCCCTCC 


2169 


Qy 


2161 


TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 


2220 






Ml 1 1 1! II 1 1 1 1 1 1 Mill 1 1 II II 1 M II II M 1 1 III II II II 1 II 1 II II II III 1 




i-JU 


Z ± / u 


TP^P^TTTP** A PT A PP TP7\ TPPTl nmmppmp a /~irp/~i rp /-irprp/- , irT"i/~i /~i a A /**• rp/"i /^rTi/^ rprnrn/~» /~i /-im/"i r-ri/—i rn/~i 

IVjLI 1 1 bAL 1 ALb 1 L A 1 bt AL I ILL ICjAL ICjIl L I L 1 (jCjAAGGTGCTGTTTGCCTGTGTG 


2229 


Qy 


2221 


CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 


2280 






lllllll 1 III Mill 1 MM Mill III Mill 1 MM MM 1 lllllllll Mill 




nk 


o o *3 a, 

ZZ J U 


LlllllAlALxAGTAlTGLlACGGlT^ 


2289 


Qy 


2281 


ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 


2340 






1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 E 1 1 1 M 1 1 [ 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 






Z Z J u 


ATPPTPAPPPPPATPATTPPPPAPPTPPPPTPPPAP'PTPPPPT'PPAPPA'PTPPTP'Pn^ a a 

AILjL 1 LALLbLLAlLAl IL7L7L7L7ALL ILAjLL 1 LLsLAL 1 1 LL7L7L J.L7LALLAI I L7L7TCTL.AAA 


2349 


Qy 


2341 


GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 


2400 






MM 1 1 1 1 1 1 1 II Mill 1 II II III IIIIMIMI 1 1 1 II 1 1 1 1 II 1 III II II 1 II 1 




Db 


2350 


GATTCGGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 


2409 


Qy 


2401 


AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 


2460 



Db 


2410 


Qy 


2461 


Db 


2470 


Qy 


2521 


Db 


2530 


Qy 


2581 


Db 


2590 


Qy 


2641 


Db 


2650 


Qy 


2701 


Db 


2710 


Qy 


2761 


Db 


2770 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 2469 
AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 2520 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 252 9 
TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2 580 

I III 1 1 1 II 1 1 1 II I II II I II 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II II I II II 1 1 III 1 1 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2 589 
CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 2 64 0 

I MM INI I III! I llllllll I INI I lllllllllllll I Mill MINIMI I II 

CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 2 64 9 
CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 27 00 

I Ml! 1 1 M 1 1 1 II 1 1 1 II III 1 1 III 1 1 1 1 Ml 1 1 MM II I II II I II MM 1 1 1 1 M 

CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 2709 

AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2760 

I I I I I I I I I I 1 I I 1 I I I I I I 1 I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I 
AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2 769 



MINI 



ABQ78865; 

09-OCT-2002 (first entry) 

Human ion exchanger protein #1 cDNA GCA mutant. 

Human; ion exchanger protein; NHIEP; nootropic; cytostatic; gene therapy; 
antiarthritic; virucide; chemotherapeutic ; cancer; arthritis; antiviral; 
gene ; s s ; mutant . 



RESULT 9 
ABQ78865 

ID ABQ78865 standard; cDNA; 2769 BP. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
OS 
XX 
FH 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 



Homo sapiens . 
Synthetic . 

Key 

mutation 



Location/Qualifiers 
replace (2113 . .2115, -) 
/*tag= a 



WO200259316-A2 . 
01-AUG-2002 . 

22- JAN-2002; 2 002WO-US001817 . 

23- JAN-2001; 2001US-0263384P . 



PA (LEXI- ) LEXICON GENETICS INC. 
XX 

PI Friddle CJ, Hilbun E; 

XX 

DR WPI; 2002-599791/64. 
XX 

PT Novel polynucleotides encoding human ion exchanger proteins that are 

PT structurally related to mammalian sodium-calcium exchanger proteins, 

PT useful for drug screening, diagnosis and in gene therapy of biological 

PT disorders. 
XX 

PS Disclosure; Page; 42pp; English. 
XX 

CC The invention relates to a novel human ion exchanger protein (NHIEP) , 

CC that shares structural similarity with mammalian sodium-calcium exchanger 

CC proteins, and potassium dependent versions of the same. The NHIEP of the 

CC invention has nootropic, cytostatic, antiarthritic , and virucide 

CC activity. The polynucleotide may have a use in gene therapy. NHIEPs can 

CC be targeted by drugs, oligos, antibodies etc., in order to treat disease 

CC or to therapeutically augment the efficacy of chemotherapeutic agents 

CC used in the treatment of cancer, arthritis, or as antiviral agents. The 

CC sequence represents a mutant form of a NHIEP of the invention. Note: The 

CC present sequence is not shown in the specification but is derived from 

CC the human NHIEP sequence shown as SEQ ID 1 (ABQ78861) 
XX 

SQ Sequence 2769 BP; 656 A; 679 C; 761 G; 673 T; 0 U; 0 Other; 

Query Match 99.5%; Score 2753; DB 6; Length 2769; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2766; Conservative 0; Mismatches 0; Indels 3; Gaps 1 
ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 6 0 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 6 0 
TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 12 0 

MM III 1 1 1 Ml I MIMI MM 1 1 INI II INI I III 1 1 1 MM Mill MIIIM 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 12 0 
ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 180 

MM MM 1 1 MM I MM I Mill I MIMI Mill MIMI MM I MM I llllll 

ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 18 0 
CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 24 0 

I II 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 IIIMMI II II 1 1 1 1 1 1 1 1 INI 1 1 II I II 1 1 1 II 1 1 1 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 24 0 
TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 3 00 

IMMMMM MM MMIIIMIIIIMI MINI MIMI I INI MMI M IIMI 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 
TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 3 60 

1 1 1 1 J 1 1 1 1 1 1 1 J r 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 j 1 1 j 1 1 j 

TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 360 
ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 42 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 



Db 



361 ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 42 0 



Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


L)D 


lUol 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 



CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 4 80 

llllllll I III II III I IIIIIIIMIIII MM I MM IIIIIIIIMIII I Mill I 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 4 80 
ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 540 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 

ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 54 0 

ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 600 
I I I I I I M I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I II I I I I I II I I I I I I I II I I 
ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 600 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 f 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 
CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 72 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 72 0 
TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

II 1 1 1 1 MM 1 1 1 1 1 1 1 MM 1 1 Ml Mill MM I II II 1 1 M 1 1 1 II II II I II I II I 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 7 80 
ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 84 0 

MMMMMMMMMMMMMMMIMMMMMMMMMMMMMMI 

ATGCACAAAAAGTAC CG CAC AGACAAAC ACCGAGGAATTATCATAGAGACAGAGGGTGAC 84 0 

CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 900 

CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 960 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 M 

CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 960 
AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCT^AT 102 0 

IMMI MMIIIMM IIIIIMMMI III! Mill MM MIM llllllllllll 

AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 102 0 
TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M I i 1 1 1 1 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 
ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 114 0 

II 1 1 MIM I III II II 1 1 1 1 Mill I M 1 1 Ml I M I MM 1 1 1 II II II III Ml 1 1 1 

ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 114 0 
TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 1200 

M I II I II M M I II 1 1 1 Ml 1 1 1 1 II MM MM II 1 1 1 1 III I II Ml I Ml II II I 

TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 12 00 
GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 

II I Mill 1 1 Ml M M 1 1 1 1 MM 1 1 II M 1 1 1 1 II 1 1 III 1 1 Ml II III 1 1 II I II I 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 



Qy 


1261 


AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 


1320 






I 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 

II 1 1 M M M M M II II 1 II II II II II II II II 1 1 1 1 II 1 1 II 1 II II II 1 1 II II II 




Db 


1261 


AAAGGGGGAGAPATGTCAAAGACPATGTATPTPfiAPTAPAAAAPAnAPPATPPTTPTnPP 




Qy 


1321 


AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 


1380 






I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 

II II 1 1 1 1 1 1 1 M 1 II 1 1 1 1 II II 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 




Db 


1321 


AATGCAGGGGCTGACTATGAGTTCACAGAGGGPAPGGTnGTTPTGAAnPPAPnAn annrr 


1 ft 0 


Qy 


1381 


CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 


1440 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M 1 1 1 1 1. 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I I M M M | | 




Db 


1381 


CAGAAGGAGTTCTCCGTGGGCATAATTGATGArGArATTTTTnAGPAfinATnAArArTTr 


1 A A 0 


Qy 


1441 


TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 


1500 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 M M II 1 II 1 1 II 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I | | 




Db 


1441 


TTTGTAAGGTTGAGPAATnTPPGPATAPAnPAPnAPPAPPPAPAPPAPPPPATPPPTPPlv 


IDUU 


Qy 


1501 


GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 


1560 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 
1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 II 1 1 II II 1 II 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 II 




Db 


1501 


GCAATATTCAACAGTCTTPPPTTGPPTPGGGPTGTPPTAPPPTPPPPTTnTnTPPPPl\PA 


lOD U 


Qy 


1561 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1620 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I | | | | | | 1 | 




Db 


1561 


GTTACCATCTTGGATGATGAPPATGPAGGPATPTTPAPTTTTnA ATPTPATAPTATTPAT 


luZU 


Qy 


1621 


GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 


1680 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1621 


GTCAGTGAGAGTATTGGTGTTATnGAGnTPAAPPTTPTPPOnAPATPAPPTPPPPPPPPT 


1 £R0 


Qy 


1681 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


1740 






1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 M 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


1681 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGAPAGPPAAGGGTGGPPPTnAGGAPTTT 


1 740 


Qy 


1741 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 


1800 






i i 1 1 i ii 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
l ll I I II II 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1741 


GAAGAPAPATATPPPGAPTTPnAATTPAAPA ATPATPAZV APTPTPA AL\ APPATBAPPPTT 


1 Qfifl 

loUU 


Qy 


1801 


AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 






IMIIIIIIIIIIIIMI MINI MIIIIIIIIIIIIIIIIIIIIIIIIMI! MINI 




Db 


1801 


AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 


Qy 


1861 


CCGAAATGGATGGAACGTGGAATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 


1920 






1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I i I i I i i i i i i i i i i t i i i i i i i i i i i 

II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 II 1 1 1 II 1 1 1 1 1 1 II 1 II 1 II 1 1 1 II II 




Db 


1861 


PPGAAATPPATPGA APPTPPA ATATPZ\PZ\TriTriZiPaPZiP2iPPi\z\r , r , T , par ,r pa r rr , r , a apt\ a 


ly^s u 


Qy 


1921 


GAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTGGGTGAACACCCCAAACTA 
1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l l l I I I i I i I 


1980 


Db 


1921 


1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II II 

OAPPAPPPPA APAPPATAPPl\Pl\PlVTPPPZi Z\ 2\HPPflr!TZiTTr i rr i TP A BPAPPPPAA A r ir P7\ 


1 Q O A 


Qy 


1981 


GAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 


2040 






1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II ! 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 




Db 


1981 


GAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 


2040 


Qy 


2041 


ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 


2100 






Mill II II MM Mill 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 Mill Mill II II II II II 




Db 


2041 


ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 


2100 



Qy 


2101 


ACCGTCAGT GCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCC 


2157 






MINIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




JJD 




ACCGTCAGTGCAGGAGGAGGGGA 1 GAGGA IGAGGATGAATCCGGGGAGGAGAGGG I GGGG 


2160 


Qy 


2158 


TCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGT 


2217 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




nu 

JJU 


zlbl 


iGGiGGl 1 1 bAL i ALb 1 1 A 1 bL AL 1 1 LL I GAL 1G 1 G J7TGTGGAAGG IGC TGTTTGCCTGT 


2220 


Qy 


2218 


GTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATT 


2277 






IMMMI MMM Ml MM III M IMM MMMMMIMM 




nk 
JJD 


O O O 1 


PTPPPPPPP A/" 1 A* r " 1 Af , T , A PTP pp7\ pipipipimpipipipipinipipimmpipipipipirnpirnp"ipi a morimOTv mnn mm 
la 1 bLLLLL LAUAbAb 1 AL 1 bLLAbbbt 1 GGGGG 1 GG I TGGGGGTGTGCATCCTCATCATT 


2280 


Qy 


2278 


GGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTC 


2337 






M MM MM M 1 M M M M M MM MM M M MMMM M M MM MMMMI 




JJD 


o o o n 


PtPl /"I A rpP^Pirppi 7\ /"i/"i/*i /"i /"I 7V rp/i TV mmA /"i /"i /"i 71 yi yirn/"l /"** /*l/im/"l/l /I -iv /"i m m /~i /*i /*i /*im/i /1 tv /"i ft tv rnm/*«/^m/im/i 

GGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTC 


2340 


Qy 


2338 


AAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTT 


2397 






MMI MMM MMIIMM MMIIMMI MMMMM M MMMMMMMM 




nk 
JJJJ 




A A APA T"T 1 ^* 1 A f ,|r Pf A f"" 1 A f'T'Pi r P r PPi T° ,r PT""PPipi r PP' , pi pi A rprprn/~i /"i /"t j\ /-1 /"irpfirp/'i rp/'i/'i /-1 71 /"i t\ rp 7\ /lommm 

AAALjA 1 1 LAb 1 LALAbL ibllbllll GG 1 GGGA 111 GGGAGG 1 G 1 G 1 LCGAGATACGTTT 


2400 


Qy 


2398 


GCCAGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACG 


2457 






1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 i 1 1 1 1 1 1 1 




UD 


z4 U ± 


pi pi pi 7\ pi pi A A A/" 1 P^T^Pl pi rppl /"i ^"1 /'i T 1 /'I pi A Pi pi A rp/*i rp tv rp 7V rp/*l /"i 71 /-1 71 pr( ppmppn rrirri/1 /"I /"l 7v 7V ✓I /I rn/"l 7v fl/*1 

GGGAGG AAAGG 1 GG 1 GGGG I GGAGGA 1 GTA 1 A 1 GG AGAGGG CTCC ATTGGCAACGTGACG 


2460 


Qy 


2458 


GGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATC 


2517 






IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




JJD 


z4ol 


/"I /l /—i 7\ /"i /"i TV tv /*i /"i /"i /"I /"I rp /"I 7\ tv rp/"irp/ii I n I ippmppp /"I tv mpppp/*imp/™i /i/im/'iyifn/i/'i^irn/i/i /*i/*i/"i /"i<"i n m /"i 

GGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATC 


2520 


Qy 


2518 


TACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTC 


2577 






M 1 1 M M M M M M M M M M M M MM M M Ml M 1 M M 1 1 M M M 1 1 M M 




nk 

JJD 




T> A PTPPPPTPTPPAPPP A PAPP 7\pmmppTi pipi*"PPi'"PPipipipipiP , "»P , '«P ,, « A Pi A /'I'T'fi/'i/'i/'irprp/irpO/if'irp/i 

1AL 1 GGGG 1L1 GGAGGGAGAGGAG 1 1 GGAGG IG 1 GGGGGGGGAGAG 1GGGGTTCTCCGTC 


2 580 


Qy 


2578 


ACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCG 


2637 






M M 1 1 M 1 M M 1 1 1 M M 1 1 1 Ml 1 M 1 M M 1 II 1 1 1 M 1 M II 1 1 1 1 II 1 M M M 




nk 
JJD 


Z Jul 


A fi pi pi I" 1 pi T" 1 1 -1 pi A pi pi A rp/irprn rp/~t pi tv rpr-pHiz-i rp/-irp/"i pi A rp/~i 7V r"i P/imn/im/irnrn/irnTi /*1/"1/1 7\ 7\ r\r> /~i/~trt rtfift 

AGGG1G1 1GAGGA1G1 1 1GGA1 1 IG1GTGGATGAGCGTGCTCTTGTACCGAAGGCGGCCG 


2640 


Qy 


2638 


CACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTT 


2697 






1 M 1 1 II 1 1 II M 1 M 1 1 MMM IMMIIMI Ml 1 1 M 1 M II 1 1 II 1 1 M 1 1 MM 




Db 


2641 


CACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTT 


2700 


Qy 


2698 


GTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAG 


2757 






1 MM 1 M 1 M M 1 M II 1 M 1 M II 1 M III M 1 M 1 1 1 M 1 1 1 1 M M II 1 M 1 M M 




Db 


2701 


GTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAG 


2760 


Qy 


2758 


GGGTTCTAA 2 7 66 




Db 


2761 


MMMMI 

GGGTTCTAA 2 769 





RESULT 10 
ABQ78866 

ID ABQ78866 standard; cDNA; 2769 BP. 
XX 

AC ABQ78866; 
XX 

DT 09-OCT-2002 (first entry) 
XX 



DE Human ion exchanger protein #1 cDNA A/G+GCA mutant. 
XX 

KW Human; ion exchanger protein; NHIEP; nootropic; cytostatic; gene therapy; 

KW antiarthritic; virucide; chemotherapeutic; cancer; arthritis; antiviral; 

KW gene; ss; mutant. 
XX 

OS Homo sapiens . 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 

FT mutation replace ( 1889 , A) 

FT /*tag= a 

FT mutation replace (2113 . .2115,-) 

FT /*tag= b 

XX 

PN WO200259316-A2 . 
XX 

PD 01-AUG-2002. 
XX 

PF 22-JAN-2002; 2002WO-US001817 . 
XX 

PR 23-JAN-2001; 2 001US- 02 63 3 84P . 
XX 

PA (LEXI-) LEXICON GENETICS INC. 
XX 

PI Friddle CJ, Hilbun E; 

XX 

DR WPI; 2002-599791/64. 
XX 

PT Novel polynucleotides encoding human ion exchanger proteins that are 

PT structurally related to mammalian sodium-calcium exchanger proteins, 

PT useful for drug screening, diagnosis and in gene therapy of biological 

PT disorders. 
XX 

PS Disclosure; Page; 42pp; English. 

XX 

CC The invention relates to a novel human ion exchanger protein (NHIEP) , 

CC that shares structural similarity with mammalian sodium- calcium exchanger 

CC proteins, and potassium dependent versions of the same. The NHIEP of the 

CC invention has nootropic, cytostatic, antiarthritic, and virucide 

CC activity. The polynucleotide may have a use in gene therapy. NHIEPs can 

CC be targeted by drugs, oligos, antibodies etc., in order to treat disease 

CC or to therapeutically augment the efficacy of chemotherapeutic agents 

CC used in the treatment of cancer, arthritis, or as antiviral agents. The 

CC sequence represents a mutant form of a NHIEP of the invention. Note: The 

CC present sequence is not shown in the specification but is derived from 

CC the human NHIEP sequence shown as SEQ ID 1 (ABQ78861) 
XX 

SQ Sequence 2769 BP; 655 A; 679 C; 762 G; 673 T; 0 U; 0 Other; 

Query Match 99.5%; Score 2751.4; DB 6; Length 2769; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2765; Conservative 0; Mismatches 1; Indels 3; Gaps 1 
Qy 1 ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 !l II ! I II 1 1 1 1 1 

Db 1 ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 



Qy 


61 


TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 


120 






I 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 t 1 

II M M i M 1 I 1 M 1 M 1 I 1 M 1 II M 1 1 II 1 II II 1 II 1 II II II II M II II 1 II 1 




Db 


61 


TTTfyrfiPTPTTPPTPAATnnTPTTPPAPPAPAnPPTnPTPPPTP APPPP 


1 0 n 


Qy 


121 


ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 


180 






1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i t 1 1 1 1 1 1 1 t 
1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 II 1 1 II 1 1 1 II II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 II 




Db 


121 


APAPPPPAPAAPAATnAPTPPTPTTPAPpnTPATPPPAPTpPA a a ndnT^Tr a tpptp 


1 Q Pi 

ioU 


Qy 


181 


CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I I I I i i i i i i i i j 
1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 I II I I | || || | | | || || | || | | 1 | | | | | | | | | | | | M 1 1 1 1 1 1 1 




Db 


181 




O A A 


Qy 


241 


TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 


300 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I i I 

II M '1 1 II 1 1 1 M 1 1 1 II II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I 




Db 


241 


TTTGTGGC C CTG AT AT A PATPTTPPTTPPPPTPTPP A TP A TTPPTP A PPPPTTP A tttt* A 


1 pi n 
JUL) 


Qy 


301 


TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t l l l l l l l I I I I I I 
1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 II 1 I I I I II || || || 1 M | | | | | 1 | 




Db 


301 


TPTATTPA APTPATPAPPTPTPA APAPAPPP anrTP apt\ atta ap* a a apppa t\ fpp 7\r*T\ a 
■l ^- J- x x urtrtu x v_.£-i x v_i-i.\_ v_ X v_ X LHrtuHuH>jooHolj X uALAH X 1 AAbjAAAb, L, LAA x LjbjAbiAA 


360 


Qy 


361 


ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 


420 






1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I l l l l l l l l l I I I I I I I i i i i i i i i i i i i i i i i i 

! ! M ! 1 1 M 1 M 1 1 1 II 1 1 1 1 II II 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 II II 1 II II 1 1 II II 1 M 




Db 


3 61 


APPAPPAPA APP APTAT'TPPPPTPT'PPZi A. TP* A A Aprppmpmppji APPTPA ppprnmii rn/~ii~* /~* 


420 


Qy 


421 


CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I I i i i i i i i i i ii i i i 
1 1 1 1 1 1 II 1 1 M 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


t~ -lovjox i^^-L^ X\j^X\-V^ HjAuHIHLILL 1L1L1 X X bjAbxbj 1 bj X \j X VjU 1 \- A 1 vjVjo 1 xb. 


A O C\ 

4 80 


Qy 


481 


ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 


540 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l l l I I I I I I I i i i i i i i i i i i i i 

II 1 1 1 1 II 1 II 1 1 1 1 1 1 II 1 1 1 II 1 I I || I I I I | | | | | | | | | | | II | | | | | | | | | | 1 | | | 




Db 


4 81 


ATTPPTPPiTPaTPTPPP.ZiPPTTPTQPPaT'TPTaPPPaPTPPaPPPTTPA apatpttpatp 
x x x uuiortiL xooo/\v_b. lit 1HL.LA1 Xbi 1 Auuu/iu 1 bjb.Aob»b, 1 1 LAALAlbl 1 L.A1 L. 


54 0 


Qy 


541 


ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I 1 1 1 1 l l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M 1 M I M M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 II II 1 1 1 1 II 1 II 1 1 II 1 1 1 1 1 II 1 1 I I I 1 




Db 


541 


ATPATTPRPATPTPTPTPTZiPPTPaTPPPapaPPPnpapaPTPPPA apatpa appatpta 
rt >- L *~**x 1 uutrt X v^. xoXvjXL IHculuAl L L L AuAL ub Au Au AL 1 LbLAAuA 1 b, AAbjb,A 1 U 1A 


/z rs r\ 

600 


Qy 


601 


CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 II M 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 II M 1 




Db 


601 


PRAPTPTTPTTPATPZ\PPPPTPPTTPPQPTa r PPTTTPPP r PAPA r rp f T l PPPT'pnnATATPA «T>nn 


660 


Qy 


661 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 


720 


Db 




1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 i 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 1 1 1 1 1 i 1 1 1 1 1 

1 1 1 1 M 1 ! 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




O O X 


PTPPPZ\PTPT r PPTPPPP r rPPTP l T l PP r PPPAPPT ir P r rr'nPA A /-'»/"• /"i/"in"i/ , "i^irn/'"i7V /nm/imnn» urti 1 nn« 

± boLAu x b. x iLlLbLL xbAjlbjlbA? xL,b,AbA7X 1 IQjCjGAAGGCCTCCTCACTCTCTT CTTC 


720 


Qy 


721 


TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 


780 


Db 




1 1 1 1 1 1 I 1 1 1 1 l l l l l l 1 1 1 1 1 t 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 t 1 1 1 1 1 i 1 i 1 1 1 

1 M M 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




/ Ct X 


'P'PTPPZiPTPTPTPT'PPTTPTPPPPTPPPT'PPPAP A T> a a 7\ r*r* a nfTinnmrifrmnrmv /-i7\ 7\ nmivo 

x x xv_b,AbjXbjXbj xlj x Lb 1 x b» xbjbibb, xbAjbi xUvjL.AvjA1 AAALGACTGCTCT TCTACAAATAC 


780 


Qy 


781 


ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 


840 


Db 




1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 Mill II 1 1 1 II 1 II 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




781 


ATGCACAaAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 


840 


Qy 


841 


CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 


900 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


841 


CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 


900 



Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 



CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 960 

II! II ' 1 1 J I III, 1 1 1 . 1 1 i . 1 1 1 : , 1 1 . 

CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 960 
AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 102 0 

M M Mill Mill I Mill MM I MINIM I MM MM MM llllll MM II 

AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 102 0 
TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

MM: II MMMM MM Ml Ml MM MIMIII IMIMII III 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 108 0 
ATGATGACTGGTGCAGGCAAT ATCCTGAAGAAACATGCAGCAGAAC AAGCCAAGAAGGCC 114 0 

1 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 

ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 114 0 
TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 1200 

MM MM MM M IIMIII MIMIIII IMIIII ill III Ml 

TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 12 00 
GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 

I II II I! I II I! 1 1 II ! II II III II III MM II 1 1 II MM II II II II II ! II 1 1 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 
AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 132 0 

1 1 II 1 1 II 1 1 Ml 1 1 1 II 1 1 1 1 III I M Mill 1 1 II 1 1 1 III I II II 1 1 1 1 1 1 1 1 1 1 1 1 

AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 132 0 
AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 13 80 

1 1 1 II I M II 1 1 1 1 1 1 1 1 1 II I II II 1 1 II I II II 1 1 II I II II 1 1 1 III I II M 1 1 1 1 1 

AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 13 80 
CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 1440 

I II II I M II 1 1 1 II I M M 1 1 II 1 1 II II 1 1 1 1 1 II I II I M 1 1 1 1 1 1 II II II 1 1 M I 

CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 144 0 
TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 15 00 

1 1 II II 1 1 1 1 M 1 1 II M I II 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 M I M 

TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 15 00 
GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

III Mill MM Mill III Mill I MIMIII MM Mill MM I Mill III 1 1 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 
GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 162 0 

I Mill 1 1 1 II I II I II I III I II 1 1 II II 1 1 1 II I M 1 1 1 1 1 II I Ml MM II II 1 1 1 

GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 162 0 
GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 1680 

1 1 1 M I II II 1 1 II II I II M I II I M II I M 1 1 1 II II I II M II II I II 1 1 II 1 1 1 1 1 

GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 1680 

ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 1740 

M I M I I I I I II I II I I I II M I I I I II I II I I II I I I I I I I I I II M I II II I I I I I I 
ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 174 0 



Qy 



1741 GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 1800 



MMIIMMMMMMMMII IMIIIMIMIMM IMIMMMMIIMIM! 

3AAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 1800 



Db 


1741 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAAPTC 


1800 


Qy 


1801 


AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 
1 II II 1 II M II II 1 i 1 M 1 II 1! I 1 II 1 1 II II II 1 II 1 1 II 1 II 1 1 II II 1 II 1 It M 




Db 


1801 


AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 

******** x xxw x xiN^jii x x-i*^i.\.pj v*»**^— * ^^**** x ** v-« v^***^**^s\^ vruiunvjriA x x x v^> x x Vw»^j. x x www w x x \J\J x uriTi 


1860 


Qy 


1861 


CCGAAATGGATGGAACGTGGAATATCAGATGTGACAGACAGGAAGCTGACTATGGAAGAA 


1920 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 
1 II 1 1 1 1 1 1 1 II 1 II II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1861 


CCGAAATGGATGGAACGTGGAATATCAGGTGTGACAGACAGGAAGCTGACTATGGAAGAA 


192 0 


Qy 


1921 


GAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTGGGTGAACACCCCAAACTA 


1980 






1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 II 1 II II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1921 


GAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTGGGTGAACACCCCAAACTA 


1980 


Qy 


1981 


GAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 


2040 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 II 1 II 1 1 1 1 II 1 1 1 




Db 


1981 


GAAGTC ATCATTGAAGAGTC CTATGAGTTC AAGAC TACGGTGGACAAACTGATC AAG AAG 


9 04 0 
* \j "± \j 


Qy 


2041 


ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 


2100 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 II 1 II II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2041 


ACAAACCTGGCCTTGGTTGTGGGGACCCATTC^TG^AG^GA^^AGTT^AT^^A(^^rPZ\Tr , 


9 1 no 


Qy 


2101 


ACCGTCAGT GCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAGAGGCTGCCC 


2157 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 1 1 1 1 I I I I || I I 1 I I I | | | | | || | | | M 




Db 


2101 


ACCGTCAGTGCAGCAGCAGGGGATGAGGATGAGGATGAATrrnnnnAnGAriZVririrTnPPP 


9 i n 


Qy 


2158 


TCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGT 


2217 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


2161 


TCCTGCTTTGAPTAPGTPATnPAPTTPPTGAPTGTPTTPTGnAAGPTPPTnTTTnPPTPT 

x v_ \_» x \_j v-. xxx uriVw x riVwVj x v_.^v x vj^,riVu X X \_ \_ X N_J.rt.V_. X \J X v_» X X \_ X OV_J_r__r_.w\J X Ov_ X VJ XXX \_J V_ \_ X \J X 


9 99 n 
z z z u 


Qy 


2218 


GTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATT 


2277 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


2221 


GTGCCCCCPAPAGAGTAPTGPPAPGGPTnGnPPTGPTTPGPPnTPTPPATPPTPATPATT 


9 9 ft 0 
z z o u 


Qy 


2278 


GGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTC 


2337 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2281 


GGCATGCTCACCGCCATCATTGGGGAPPTGGPPTPGPAPTTPGGPTGPAPPATTnGTPTP 

lv Jvi v<f.\»v_\j\*\-r_ x v_._t x x v_»vj\jv_»./-_.v». v_ x v_j\_j v— v_» X v_»\_7\_».f^\ — x X \_ \_jv_j v_ X V_J\_»./A\ \^r\ X X VJVJ 1 v_ 1 L 


9 "*4 0 


Qy 


2338 


AAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTT 


2397 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2341 


AAAGATTCAGTCAPAGCTGTTGTTTTPGTnnPATTTGGPAPPTPTriTPPPAPATAPnTTT 


9 4 fi 0 
Z ft U U 


Qy 


2398 


GCCAGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACG 


2457 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l l l l l I I I 
1 1 1 1 1 II 1 II 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 II 1 II 1 1 1 1 II 1 II 1 II II II II 




Db 


2401 


GPPAGPAAAGPTGPTf5PPPTPPAPnATPTATaTf^Pi\P2_ 1 ppppTPPi\T , T , r:r'r , a ar'r'^r'ar 1 ^ 

VJ v_- v_._^\_7\_,_^xtjtA.nj v.. x UL X v_J v_, V_» v_ X ^^nVJOr. X vj Inlnl \_l\_,.rt\_7/-i.V_,\_j\_. V_ X V_V_i-_ X X VJv_xV_-r^_H\«.Vj X \_jiH.\_, \_J 


z 4 b U 


Qy 


2458 


GGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATC 


2517 






1 1 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 




Db 


2461 


GGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATC 


2520 


Qy 


2518 


TACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTC 


2577 






1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 INI 1 II! II 1 1 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 MM 1! 




Db 


2521 


TACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTC 


2580 


Qy 


2578 


ACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCG 


2637 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 



25 81 ACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCG 264 0 



Qy 2 63 8 CACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTT 2697 

1 1 1 1 1 II 1 1 1 III 1 1 1 Mil 1 1 1 1 1 1 1 1 1 II I Mill MM 1 1 1 II II I III 1 1 1 1 II 1 1 

Db 2641 CACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTT 2 700 

Qy 2698 GTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAG 2 757 

Ml I MM 1 1 MM 1 1 Mill II II 1 1 Mill Mill 1 1 II I II 1 1 II 1 1 II 1 1 II II II 

Db 2701 GTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAG 2760 

Qy 2758 GGGTTCTAA 2766 

IMIIIMI 
Db 2761 GGGTTCTAA 2769 



RESULT 11 
ABA04756 

ID ABA04756 standard; cDNA; 2781 BP. 
XX 

AC ABA04756; 
XX 

DT 25-FEB-2002 (first entry) 
XX 

DE Human natrium ( + ) -calcium (2+) exchanger form 3 protein, HNCX3, cDNA. 
XX 

KW Human; Natrium (+) -Calcium (2+) exchanger form 3; HNCX3 ; chromosome 14; 

KW cardiac failure; myocardial infarction; cardiac hypertrophy; arrhythmia; 

KW myocarditis; pulmonary hypertension; cardiotoxicity ; cardiant; Vaccine; 

KW coronary heart disease; renal failure; ischaemic disorder; 

KW Antiarrhythmic; Vasotropic; Hypotensive; cardiovascular disorder; ss. 

XX 

OS Homo sapiens . 



XX 

FH Key Location/Qualifiers 

FT CDS 1. .2781 

FT /*tag= a 

FT /partial 

FT /product= "Human natrium (+) -calcium (2+) exchanger form 3 

FT protein, HNCX3 " 

FT /note= "No stop codon given" 

XX 



PN WO200183744-A2 . 
XX 

PD 08-NOV-2001. 
XX 

PF 30-APR-2001; 2001WO-EP004886 . 
XX 

PR 02-MAY-2000; 2000EP-00109080 . 
XX 

PA (MERE ) MERCK PATENT GMBH. 
XX 

PI Wilm C; 
XX 

DR WPI; 2002-041493/05. 
DR P-PSDB; AAM47745. 
XX 

PT New polypeptide, useful as vaccines for inducing immune response against 



PT diseases such as myocardial infarction, arrhythmia, ischemic disorders, 

PT renal disorders in mammal. 

XX 

PS Claim 4; Page 34-38; 41pp; English. 
XX 

CC The present sequence is the coding sequence for human Natrium (+) -Calcium 

CC (2+) exchanger form 3 (HNCX3) . The HNCX3 gene maps to human chromosome 

CC 14 . HNCX3 and its coding sequence are useful for treating acute and 

CC chronic cardiac failure of different aetiologies, myocardial infarction, 

CC cardiac hypertrophy, arrhythmia, myocarditis, pulmonary hypertension, 

CC cardiotoxicity (e.g. induced by chemotherapy), coronary heart disease, 

CC acute and chronic renal failure, ischaemic disorders of skeletal muscle 

CC and ischaemic brain disorders of different aetiologies 
XX 

SQ Sequence 2781 BP; 658 A; 678 C; 765 G; 680 T; 0 U; 0 Other; 



Query Match 98.8%; Score 2733.4; DB 6; Length 2781; 

Best Local Similarity 99.3%; Pred. No. 0; 

Matches 2762; Conservative 0; Mismatches 1; Indels 18; Gaps 1 



Qy 


i 


ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 


60 






hlllillll IMMIII llllllll IIIMIII llllMllillllllllllll) 




Db 


i 


ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 


60 


Qy 


61 


TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 


120 






1 1 1 1 1 1 II Ml 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 MM 1 III II II 1 II II 1 II 1 II 1 1 1 




UD 


b ± 


TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 


120 


Qy 


121 


ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 


180 






1 1 1 Ml 1 1 Ml 1 1 1 Mill 1 1 1 1 II II 1 II 1 1 1 1 1 II III M MM 1 M Mill 1 MM 1 




UD 


1Z 1 


ACACjOCjLAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 


180 


Qy 


181 


CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 


240 






1 1 II 1 M 1 II 1 M 1 M II M 1 1 M 1 II 1 1 1 M M 1 1 1 M II 1 1 II 1 M M II M II M M 




Db 


181 


CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 


240 


Qy 


241 


TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 


300 






Mill lllllllil IliMIIIMIIIIillMIIIIMIMIIII IMIIIIIMII 




Db 


241 


TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 


300 


Qy 


301' 


T CTATTGAAGT CATCAC CTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 


360 






1 M 1 1 1 II M II M M II 1 1 1 M 1 M 1 M 1 II II 1 1 1 1 II 1 1 M 1 1 Ml II II 1 1 1 M M 




Db 


301 


TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 


360 


Qy 


361 


ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 


420 






MM II MM II Ml MMMIMMII IIM III III! IIMMIM 




Db 


361 


ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 


420 


Qy 


421 


CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 


480 


Db 


421 


1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 


480 


Qy 


481 


ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 


540 






1 M Ml II MM M Mill 1 Ml Ml 1 MM 1 1 1 Ml 1 II II MM Ml MM 1 1 Ml M 




Db 


481 


ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 


540 


Qy 


541 


ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 


600 



IIIIIIMI llllllll III IIIIIIIIIIIIMIIII lllllll III Mill lllllll 

ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 600 



Db 


541 


ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 


600 


Qy 


601 


CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 
1 M 1 1 1 M 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 II II II 1 II 1 1 1 1 II 1 1 1 1 1 II 1 1 II 1 1 1 II 1 II I 




Db 


601 


CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


660 


Qy 


661 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 


720 






I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

I I 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 




Db 


661 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTrTTr 


/ £t yj 


Qy 


721 


TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 


780 






1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 




Db 


721 


TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 


780 


Qy 


781 


ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 II 1 M II 11 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 1 I I I I I | | | | | | | | | | | | | 




Db 


781 


ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 


840 


Qy 


841 


CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 


900 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 M II 1 1 1 1 II II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 




Db 


841 


CAC C CTAAGGGC ATTGAGATGGATGGGAAAATGATGAATTC C C ATTTTCTAGATGGGAAC 


qoo 

J V \J 


Qy 


901 


CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 


960 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 




Db 


901 


CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC: 




Qy 


961 


AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 


1020 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 II 1 




Db 


961 


AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGPTnnTOnAfiATnrirrAAT 


i o? o 


Qy 


1021 


TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 


1080 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II 1 1 




Db 


1021 


TACTATGCTCTTTCCCACCAACAGAAGAGCCGTGrrTTrTArrnTATrPAAnrrArTPnT 


i oro 

J. U O u 


Qy 


1081 


ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 


1140 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 
1 N 1 1 1 M II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1081 


ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATnrAnrAGAArAAnrPAAnAAnnrr 


J- X.** \J 


Qy 


1141 


TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 


1200 






1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 

1 M II 1 II II II 1 II II 1 1 i II 1 M M II | M || M 1 II 1 II II 1 1 1 II II II II II II 1 




Db 


1141 


TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGAPTTTATTTrPAAnnTrTTrTTT 


19 00 


Qy 


1201 


GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 


1260 






lllllll lllllll MINIMUM MM llllllllll IIIIIMIIIIIMIMI 




Db 


1201 


GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 


1260 


Qy 


1261 


AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 


1320 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I I 




Db 


1261 


AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 


1320 


Qy 


1321 


AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 


1380 


Db 


1321 


1 1 II 1 II I! 1 1 1 1 1 II 1 III 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 II 1 1 III III 1 1 II 1 1 Ml 

AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 


1380 


Qy 


1381 


CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 


1440 



M MM I MINIMI IIIIIIIMIIMI IIIMIIIMIIII MINIUM Mill 



Db 


1381 


PAPAAPPAGTTPTPPGTGGGPATAATTPATPAPPAPATTTTTPAPPAPPATPAAPAPTTP 

v^i-iVJ/-ir\Ovjrt.vj J. J. x v_\_\_r x uuuv^n x r^n. x x vj.r* x VJrtV_OrtOrt X X X X X OrtOOrtOOrt X unnLnL XIV— 


1 A A O 


Qy 


1441 


TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 


1500 






Mill lllllllll Mill Mill! Mill II Mill Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 




Db 


1441 


TTTGTAAGGTTGAGPAATGTPPPPATAPAPPAPPAPPAPPPAPAPPAPPPPATPPPTPPA 


1DUU 


Qy 


1501 


GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 


1560 






IIIIIIIIMIIIIMIMIIIMIIIIMIIMIIMIIIIIMIIMIIIIIIIMII 




Db 


1501 


GCAATATTPAAPAGTPTTPPPTTPPpTpppp,PTPTPPTAPPPTPPPPTTPTPTPPPPAPA 

X v_rt/-i.v_.rt.VJ XV— X X OV_V_ X X OV_V_ X V— OOOO X O X V— V— IAVjV^L X V— V— V— V— X IvJ ibl OOV_L.rtOrt 


X D D U 


Qy 


1561 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1620 






1 1 Mill 1 II 1 II 1 1 Mill 1 1 1 1 1 III 1 1 II 1 II MM 1 III MM 1 1 Mill 1 II II 1 




Db 


1561 


GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 


1620 


Qy 


1621 


GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 


1680 






MIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIMIMIIIMIIMIIMMIIIIII 




Db 


1621 


PTPAGTPAPAnTATTPPTPTTATPPAPPTP a appttptpppp ap aTPannTnrrfrT'pr'T 

w X X VJ^-1VJ^-1VJ X.tt.X X \J\J X VJ X J. M X UuriUU X OrtrtOO X X V— X OV^V3VJAV.nl V.riUV7 1 VJLLLVJVJVJVJ X 


i (Ton 


Qy 


1681 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


1740 


Db 


1681 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 It M 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACAGTCATCGTCPPPTTTAGGAPAPTAPAAPPPAPAPPPAAPPPTPPppPTPAPPAPTTT 

-ij-V-J x v^^i x v— \J x v*\_. \_. v_« x x X .T4.\J\J.rt^.tt.w 1 nOnnOOOnVvnOV^Vk.ririVjVjVj X OOO OO X Ort.OOrtO XXX 


x / *t u 


Qy 


1741 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 


1800 






III Mill 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II Ml II Ml Mill 




Db 


1741 


PA APAPAPATATPPPP APTTPPA ATTPA AP A ATP A TP a A A PTPTf AAAAPPATA APPPTT 
OrtrtOrtOrtOrt x rt x OOO OrtO X X OO.Hrt, X X v_ rtrtOttrt. X Or\ X OrtAHO X vj X Ortrtrtrtoort. 1 Anvjou 1 1 


1 QArt 

±oUU 


Qy 


1801 


AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 J 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1801 


AAAATAPTAPATPAPPAPPA ATAPPA A APPPA APAPA ATTTPTTPATTPPPPTTPPTPA A 


loou 


Qy 


1861 


CCGAAATGGATGGAACGTGGAATATC AGATGTGACAGACAGG 


1902 






III lllllllll Mill lllllllll II MM II Ml Mill 




Db 




PPPA A ATPPATPPA APPTPPA AT ATP APPTPTPAPATTPTTT A A APaTPTPRPaP BPaPP 
V- v- Ortrtrt. X OOrt. X VjrOM-rt.V_.Vj X OOrtrt. Irt 1 LnUU X O X OrtOrt 1 1L1 1 1 rtrtrtOrt X O X Ort.0rtort0rt.00 




Qy 


1903 


AAGCTGACTATGGAAGAAGAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTG 


1962 






1 1 1 II 1 1 II II 1 1 1 1 M 1 1 1 1 II 1 II 1 II 1 1 II 1 1 1 1 1 1 1 1 II 1 II II 1 1 II 1 II 1 1 M 1 




Db 


1921 


AAPPTPAPTATPPA APA APAPPAPPPPA APAPPAT APP AP AP ATPPPA A APPPAPTATTP 
rtrto v- x ortL x rt. x uunrtUrtrturtuunuuL V_rtrtOrtOOrt X rtV_jV_.rt.Ort.Vjrt. X oubHHAuL Ln.b 1AJ. lu 


iy o u 


Qy 


1963 


GGTGAACACCCCAAACTAGAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTG 


2022 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 i 1 1 1 1 1 1 [ : 1 1 1 III' II 




Db 


1981 


PPTPA APAPPPPA A APT APA APTPATPATTPA APAPTPPTATP APTTPA aPQPTBPPPTP 
VJVJ X VJ.ttJ-10 rtOO O OrtrtrtO X rt.Vjrtrt.Vj X V—rt. X Vwrt. X X Ortrt.Vjrt.Vj X V_ V_ 1 rt X OrtO X X V_.rtrt.OrtO x rtooo X O 


o n a n 
Z U4 U 


Qy 


2023 


GACAAACTGATCAAGAAGACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGAC 


2082 






1 MMMM III II II III MMIM MIMMII MM II 






Z U 4: ± 


P Zip a a APTP ATP A apaapapaa apptppppttppttptpppp a ppp attpptpp 7\r*r*r* a r* 
bAtAAAL X VjH 1 L AAbAAbAL AAAL L 1 OOOO 1 1 bb X 1 o X bbbbALLLAl 1 \_C_ 1 bbAbbbAL 


2100 


Qy 


2083 


CAGTTCATGGAGGCCATCACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGG 


2142 






Ml II II Mill lllllllll MUM MM II Ml II Ml MM Mill Mill Mill 




Db 




PAPTTP ATPPAPPPP ATPAPPPTP APTPP APPAPPPP ATP APP ATP app atp A Ts.'vnnnr^n 
OrtO X X Ort. X OOrt.OOv_.Ort X Ort-O v_0 X V^rtO X Ov-rtOv_rtOOOOrt X OrtOOrt X OrtOOrt. 1 Ortrt. 1 LLbub 


O 1 C A 

Z lb U 


Ov 


2143 


GAGGAGAGGPTGPPPTPPTPPTTTPAPTAPPTPATPPAPTTPPTPAPTPTPTTPTPPA AP 

wrtVJVJrtVJrtVjvj^ X VJV_V_V_ X OO X OV_ X X X OrtO XrtOO X Ort X OOrtO X X V— • O X OrtO XOXOX XV_X OOrtrtO 


z z u z 






1 1 1 II 1 II 1 III 1 1 1 II MM 1 1 1 1 1 II 1 1 1 II 1 1 MM 1 III MM III III 1 MM II 




Db 


2161 


GAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAG 


2220 


Qy 


2203 


GTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTC 


2262 






1 M 1 II 1 II 1 1 1 II 1 1 1 1 1 1 II 1 1 M 1 1 II 1 M II 1 1 M II 1 1 II M 1 II 1 1 II 1 1 II M 




Db 


2221 


GTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTC 


2280 



Qy 


2263 


TCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGC 


2322 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 1 1 1 1 

1 M M 1 M M M M 1 M 1 M 1 II II 1 1 M 1 11 1 II I 1 II 1 M M M ! M M M II M 1 II 




Db 


2281 


TCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGC 


2340 


Qy 


2323 


TGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCT 


2382 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 
1 M M 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 II I 1 1 1 I 1 M I I I M I | | | | | | 1 




Db 


2341 


TGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTf'T 


2400 

Z. *± \J \J 


Qy 


2383 


GTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCC 


2442 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 II II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II II II II 1 




Db 


2401 


GTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGrrTCP 


£t *± O \J 


Qy 


2443 


ATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGG 


2502 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 II II 1 II II 1 1 1 1 1 II II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 




Db 


2461 


ATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGG 


£t J £, \J 


Qy * 


2503 


TCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACA 


2562 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2521 


TCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTfTArOTnTrnnPmfJrAra 


neon 

Z D O U 


Qy 


2563 


CTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTG 


2622 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 II 1 II 
1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


2581 


CTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGPTrTTn 


£* O *± \) 


Qy 


2623 


TACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCC 


2682 






1 1 1 1 M 1 1! 1 II 1 1 II 1 IIMIMI 1 1 Ml 1 1 1 1 1 1 MM 1: 1 1 1 II 1 1 MM 1 II 




Db 


2641 


TACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCC 


2700 


Qy 


2683 


ACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCC 


2742 






MM 1 MM 1 MM 1 Ml 1 II M 1 1 1 MM! 1 1 MM 1 El M 1 M 1 1 1 II II Ml 1 II 




Db 


2701 


ACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCC 


2760 


Qy 


2743 


TATTGCTACATCAAGGGGTTC 2763 




Db 


2761 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 
TATTGCTACATCAAGGGGTTC 2 781 





RESULT 12 
ABX56263 

ID ABX56263 standard; DNA; 2685 BP. 
XX 

AC ABX5 62 63; 
XX 

DT 19-FEB-2003 (first entry) 
XX 

DE Human NOVlc 24 8 057 963 DNA SEQ ID 5. 
XX 

KW NOVX; human; antidiabetic; antiarteriosclerotic ; anorectic; nootropic; 

KW metabolic; antimicrobial; neuroprotective; antiparkinsonian; cardiant; 

KW antilipaemic; cytostatic; immunomodulatory; gene therapy; dyslipidaemia ; 

KW cardiomyopathy; metabolic disorder; diabetes; atherosclerosis; obesity; 

KW anorexia; neurodegenerative disorder; Alzheimer's disease; cancer; gene; 

KW Parkinson's disease; haematopoietic disorder; metabolic disturbance; 

KW metabolic syndrome X; wasting disease; ds . 
XX 

OS Homo sapiens . 



XX 

PN WO200281625-A2. 
XX 

PD 17-OCT-2002. 
XX 

PF 03-APR-2002; 2002WO-US0103 66 . 
XX 

PR 03-APR-2001; 2001US-02 81086P . 

PR 05-APR-2001; 2001US-02 81906P . 

PR 06-APR-2001; 2001US- 0282020P . 

PR 10-APR-2001; 2001US-0282930P. 

PR 12-APR-2001; 2001US- 0283444P . 

PR 12-APR-2001; 2001US-0283512P . 

PR 13-APR-2001; 2001US- 0283657P . 

PR 13-APR-2001; 2001US- 0283678P . 

PR 13-APR-2001; 2001US- 0283710P . 

PR 17-APR-2001; 2001US- 0284234P . 

PR 19-APR-2001; 2001US- 0285325P . 

PR 20-APR-2001; 2001US- 0285381P . 

PR 24-APR-2001; 2001US- 02 86068P . 

PR 25-APR-2001; 2 001US - 02 862 92P . 

PR 07-JUN-2001; 2 001US - 02 96692P . 

PR 26-JUN-2001; 2001US- 0300883P . 

PR 08-AUG-2001; 2 001US - 03 11003P . 

PR 13-AUG-2001; 2 001US- 03 11973P . 

PR 16-AUG-2001; 2001US- 03 12901P . 

PR 14-SEP-2001; 2 001US - 03222 83P . 

PR 05-OCT-2001; 2 001US - 032 744 8P . 

PR 31-DEC-2001; 2001US-0345734P . 

PR 03-JAN-2002; 2002US-0345755P . 

PR 04-FEB-2002; 2002US-0354391P . 

PR 02-APR-2002; 2002US-00114153 . 
XX 

PA (CURA- ) CURAGEN CORP. 
XX 

PI Padigaru M, Shenoy SG, Kekuda R, Rastelli L, Mezes PD; 

PI Smithson G, Guo X, Gerlach V, Casman SJ # Boldog FL, Li L; 

PI Zerhusen BD, Tchernev VT, Gangolli EA, Vernet CAM, Spytek KA; 

PI Malyankar UM, Patturajan M, Miller CE, Taupier RJ, Heyes MP, Ju J; 

PI Peyman JA, Catterton E, Macdougall JR, Edinger SR, Stone DJ; 

PI Mazur A; 

XX 

DR WPI; 2003-046862/04. 

DR P-PSDB; ABU12 04 3 . 
XX 

PT New isolated NOVX polypeptide useful for treating cardiomyopathy, 

PT atherosclerosis, metabolic disorders, diabetes, obesity, infectious 

PT disease, anorexia, neurodegenerative disorders, Alzheimer's disease and 

PT cancer. 

XX 

PS Claim 3; Page 85-86; 425pp; English. 
XX 

CC This invention describes novel polypeptides, termed NOVX which have 

CC antidiabetic, antiarteriosclerotic, anorectic, metabolic, antimicrobial, 

CC neuroprotective, antiparkinsonian, antilipaemic, cytostatic, nootropic, 

CC cardiant and immunomodulatory activity. The polypeptide and any 

CC antibodies generated from it are useful in the manufacture of a 



CC medicament for treating a syndrome associated with a human disease 

CC selected from a pathology associated with the NOVX polypeptide. Fragments 

CC and portions of the polynucleotides encoding NOVX polypeptides are useful 

CC to map the location of NOVX genes on a chromosome, to identify 

CC individuals from minute biological samples, as DNA markers for 

CC restriction fragment length polymorphism (RFLP) , and are useful to 

CC prepare polymerase chain reaction primers. The products of the invention 

CC can be used in gene therapy and for treating cardiomyopathy, metabolic 

CC disorders, diabetes, atherosclerosis, obesity, infectious disease, 

CC anorexia, neurodegenerative disorders, Alzheimer's disease, Parkinson's 

CC disease, immune disorders, haematopoietic disorders, and various 

CC dyslipidaemias, metabolic disturbances associated with obesity, metabolic 

CC syndrome X and wasting disorders associated with chronic diseases and 

CC various cancers. ABX562 61-ABX56306 represent the polynucleotide fragments 

CC which encode the NOVX polypeptides represented in ABU12 041-ABU12 086 

XX 

SQ Sequence 2685 BP; 645 A; 657 C; 741 G; 642 T; 0 U; 0 Other; 



Query Match 96.6%; Score 2673.2; DB 8; Length 2685; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2675; Conservative 0; Mismatches 3; Indels 0; Gaps 0 



Qy 


86 


GAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGCACAGGGCAGAACAATGAGTCCTGTT 


145 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 
II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 II II 1 1 1 1 II 1 1 1 II 1 II II 1 1 1 1 II 1 II 1 1 1 




Pb 


2 


GATCCGAGGCTGGTGGCTCAGGGGACGTGCCAAGCACAGGGCAGAACAATGAGTCCTGTT 


61 


Qy 


14 6 


CAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTGCCAATCTGGTACCCGGAGAACCCTT 


205 






1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I 1 I I I I I I I I 
1 1 1 1 1 1 1 1 M II 1 II 1 II 1 1 II II II 1 II 1 1 II 1 1 I I 1 1 1 II II I I I I 1 M M M | 1 M 




DiD 


62 


CAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTGCCAATCTGGTACCCGGAGAACCCTT 


121 


Qy 


206 


CCCTTGGGGACAAGATTGCCAGGGTCATTGTCTATTTTGTGGCCCTGATATACATGTTCC 


265 






Mill 1 II 1 II II 1 1 MM 1 II 1 1 1 1 Mil 1 1! 1 1 III 1 1 II MINIUM 1 




Db 


122 


CCCTTGGGGACAAGATTGCCAGGGTCATTGTCTATTTTGTGGCCCTGATATACATGTTCC 


181 


Qy 


266 


TTGGGGTGTCCATCATTGCTGACCGCTTCATGGCATCTATTGAAGTCATCACCTCTCAAG 


325 






II Mill MM III 1 MM II II II III Mill Mill III Mill INI III II Mill 




Db 


182 


TTGGGGTGTCCATCATTGCTGACCGCTTCATGGCATCTATTGAAGTCATCACCTCTCAAG 


241 


Qy 


326 


AGAGGGAGGTGACAATTAAGAAACCCAATGGAGAAACCAGCACAACCACTATTCGGGTCT 


385 






1 1 1 1 1 1 1 1 II 1 Ml II 1 1 1 II II 1 1 1 1 1 1 II 1 1 III 1 1 II 1 II 1 1 1 II II I II II || 1 1 1 




Db 


242 


AGAGGGAGGTGACAATTAAGAAAC C C AATGGAGAAAC CAGCACAACCACTATTCGGGTCT 


301 


Qy 


386 


GGAATGAAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCCTCTGCTCCTGAGATAC 


445 






1 1 MM 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II Mill 1 II II 1 1 1 II 1 II 1 1 1 1 1 II 1 MM II II 1 1 1 




Db 


302 


GGAATGAAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCCTCTGCTCCTGAGATAC 


361 


Qy 


446 


TCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGATCTGGGACCTTCTA 


505 






M II 1 II MM III II II II MM II 1 1 Mill Mill III MINIUM IMIIIIII 




Db 


362 


TCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGATCTGGGACCTTCTA 


421 


Qy 


506 


CCATTGTAGGGAGTGCAGCCTTCAACATGTTCATCATCATTGGCATCTGTGTCTACGTGA 


565 






1 1 IMIII 1 1 M M 1 II 1 1 MM Mill Ml II 1 M II 1 MM Ml II 1 M M II Ml II 




Db 


422 


CCATTGTAGGGAGTGCAGCCTTCAACATGTTCATCATCATTGGCATCTGTGTCTACGTGA 


481 


Qy 


566 


TCCCAGACGGAGAGACTCGCAAGATCAAGCATCTACGAGTCTTCTTCATCACCGCTGCTT 


625 






M II 1 II 1 1 1 II 1 M 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M M 1 1 1 




Db 


482 


TCCCAGACGGAGAGACTCGCAAGATCAAACATCTACGAGTCTTCTTCATCACCGCTGCTT 


541 



Qy 


626 


GGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGG 


685 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 l l l l I l l l l I I l l l l l l l l l l l l l I l l l l l l l 

1 II II I 1 II 1 II 1 1 II II 1 1 M Ml 1 II II 1 II 1 II II II II 1 II II 1 II 1 II 1 II II II 




Db 


542 


pp apt atptttppptapatptppptpt at a tp a ttptppp a ptpttptpppptpptptpp 

OOrtO lrtl L X x X OUV. x n\«n X L X uuL X L- JLAlAlbHl 1^,1 oVjLftL* 1L1 1L. X L LL,L 1 UU X VJ X \J\J 


£ 0 1 
DUX 


Qy 


686 


TCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCT 


745 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 i 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 

1 M 1 1 1 1 II 1 II 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


602 


TCPAGGTTTGGGAAGGPPTPPTPAPTPTPTTPTTPTTTPPAPTPTPTPTPPTTPTPPPPT 

x \~\~r*i\j\j xxx uuunnuvjuv., i v-v- l V-.ft^, ivi^l X L X X \_ X J. X L LftO lUlVJlulLLl X L X uu^L X 


DDI 


Qy 


746 


GGGTGGCAGATAAACGACTGCTCTTCTACAAATACATGCACAAAAAGTACCGCACAGACA 


805 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I I I I I I I I I I I I I I I I I I I 

M II II 1 1 II 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 I I I I I 




Db 


662 


GGGTGOPAPATAAAPPAPTPPTPTTPTAPAAATAPATPPAPA A A A APTAPPPPAPAPAPA 

vjvjvj x VJVJLftVjft x ftftftLVJftL X UL 1 V- ± 1 v 1 ftLftftft X ftLft X vJLftLftftftftftVj X ftLLVJLftLftVJftLft 


TOT 


Qy 


806 


AACACCGAGGAATTATCATAGAGACAGAGGGTGACCACCCTAAGGGCATTGAGATGGATG 


865 






1 1 i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l l l I l I l I I I I i I I i i i i i i 

M 1 M 1 1 1 1 II 1 II 1 II 1 1 1 1 II 1 II 1 1 II 1 II 1 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


722 


A AP A PPPt APP A ATT ATP AT AP AP A P AP APPPTPAPP A PPPT A APPPPATTPA PS TPP A TP 

firt\-riV^\^WrtvJ\Jrirl. X X ft X Lft X ftOftOftLftvJftVjVJvJ X VjftLLftL L L X ftftvjVjvjLft X X Oftwft X X O 


/ O 1 


Qy 


866 


GGAAAATGATGAATTCCCATTTTCTAGATGGGAACCTGGTGCCCCTGGAAGGGAAGGAAG 


925 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l l l l 

II 1 II 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


782 


PPA A A ATPATPA ATTPPPATTTTPTAPATPPPA APPTPPTPPPPPTPP A APPP A APPS 
OVJftftftft x Vjft x VJftft x IL^V^iil 1 1 1L IHuHluuunnv^L lUUiuLLLL IuunnUUUiy\uunAu 




Qy 


926 


TGGATGAGTCCCGCAGAGAGATGATCCGGATTCTCAAGGATCTGAAGCAAAAACACCCAG 


985 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 I 1 1 I I l l l l l l l l l l l l I i I I I I I i I I i I i i i i t i 

1 M M 1 1 1 MM MM II 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 II 




Db 


842 


TPPATPAPTPPPPP APAPAP ATPATPPPPATTPTP A APPATPTP AAPPAAAA APAPPr 1 Af* 1 
x Uun x \Jr\\J x L, L. L Vj L, ft Oft Vjft vjft 1 Lxft X LLVjoft X 1 L 1 LAnVJUn 1 1* 1 unAuLAfiAAALAL LLAvj 


y u i 


Qy 


986 


AGAAGGACTTAGATCAGCTGGTGGAGATGGCCAATTACTATGCTCTTTCCCACCAACAGA 


1045 






i M 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 1 I 1 1 1 1 
1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


909 
.7 \J 


APA APPAPTTAP ATP APPTPPTPPAPATPPPP A A TTflPTBTPPTPTTTPPfA fnA 7\nj\ria 
ftVJftftvjLxftL. X JlHuH 1 LrtuL 1 uu J. uoHun 1 ouLL.nH 1 1 AL 1 A luL i L 1 1 1 LLLAllAAlAvzA 


y b i 


Qy 


1046 


AGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGTATGATGACTGGTGCAGGCAATATCC 


1105 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 I I It I I I I I I I I I I I I I I I I I I I I I I I I i i i I i 

1 M M M M II 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I | | | | | | | 




Db 


962 


APAPPPPPPPPTTPTAPPPTATPPA APPP APTPPTATPATPAPTPPTPP APPP A ATATPP 
r\\Jr\\j v^^U^UV^U XXV, Xft.LL.vJ Irii LLftnVJLV.RU X LL7 X ft X Vjft X VJftL X X oLHvjVjL Aft 1 ft 1 LL 


lUzl 


Qy 


1106 


TGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCCTCCAGCATGAGCGAGGTGCACACCG 


1165 






1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 I I I I 1 I I I I I I I I I I I I I I I I I I l I I 1 i l I i i i I i I i I i i i i i i 
1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


i nop 


TP A APA A APATPPAPP APA ap a aPPPB apa apppptppappatv^ app^ Ar , r ,r rr , r , A r'APr'n 

1 UftrtUftftrtLAl VjLftoLftVjftftLftHuLLftftOftftUOLL 1 LLrtuLft 1 urtuLuAub 1 uLftLALLb 


luol 


Qy 


1166 


ATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTTGACCCATGTTCTTACCAGTGCCTGG 


1225 






i 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 ) 1 1 1 1 l l l l l l l l l l l l l l I l I > I 

M M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 I I I I I I I I I I || 1 I | | | | | | | | | 




Db 


1 0R9 


ATPAPPPTPAPPAPTTTATTTPP A APPTPTTPTTTP j\PPP A r Pr ,r rTT""P r PA r*r*T\ PTPPPTT'P 
ft i oauLL 1 uHuoHL X X XftX X llLHHooltl ILl 1 IbftLLLftiul 1L1 lALLftoloLLluvj 


T T A T 
1141 


Qy 


1226 


AGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGGAAAGGGGGAGACATGTCAAAGACCA 


1285 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 l l l l l l l I l l l l I I I I I I I I I I I i i I i i i i i i i i i i i i i i i i i i 

1 M 1 M 1 1 1 M II 1 II II 1 II II II II II II II II II 1 1 II II II II II II II 1 II II II 




XJXJ 




ftvjftftL 1 \j x owuuL I o I Al llLl ljAlAvj 1 1 UAuuAAAljljljLjljAvjAlA i \j i lAAAvjAllA 


1201 


Qy 


1286 


TGTATGTGGACTACAAAACAGAGGATGGTTCTGCCAATGCAGGGGCTGACTATGAGTTCA 


1345 






I I I I I i I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i I i i i i i i i i i i i i i i i i i > i 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 I I I I 1 1 | | | I | | | 1 1 | 1 1 1 1 1 | | || | 1 1 






19 09 


TPTBTPTPPfiPTAPA AAA P7\P7\ r*r~* A TPPfPTPTPPPT^ A IT* /*i A PPPOPTT" A nmA TT* A PmmP7\ 

xL7 xAxLtIvjlxAl 1 Al AAAAl AUALjLjA 1 LAj 1 1 l 1 bLLMi OlAljljvjCjl ICjAl 1 ATUAGTTCA 


1261 


Qy 


1346 


CAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACCCAGAAGGAGTTCTCCGTGGGCATAA 


1405 






M 1 1 M M M M M M M 1 M M M M M M M M 1 M M M M 1 M 1 M M M M M M 




Db 


1262 


CAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACCCAGAAGGAGTTCTCCGTGGGCATAA 


1321 


Qy 


1406 


TTGATGACGACATTTTTGAGGAGGATGAACACTTCTTTGTAAGGTTGAGCAATGTCCGCA 


1465 






M M M 1 M M 1 M M M M M M M M 1 M M M M 1 M 1 M 1 M 1 M 1 1 M M 1 M M 




Db 


1322 


TTGATGACGACATTTTTGAGGAGGATGAACACTTCTTTGTAAGGTTGAGCAATGTCCGCA 


1381 



Qy 


1466 


TAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCAGCAATATTCAACAGTCTTCCCTTGC 


1525 






1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


i ;p') 
1 jo/ 


tapt\pp7\ pn 7\ppAr | ppAr , Ar , P a r*r*r*r* a tppptpp7\ pp a a t< a •"ptp a a p a p | T'P'T'n">ppp'T'»T'pp 
1 AGAGGAGGAGGAGGGAGAGGAGGGGA 1 GGG 1 LLAbCMTA 1 1 GAAGAG 1 G 1 1 GLGTTGG 


T A A T 
1441 


Qy 


1526 


CTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGATGATGACCATG 


1585 






1 1 1 1 1 1 1 1 1 II 1 1 Ml 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Dh 




U 1 ILj Iv^U lAbLL lt-Cv„G 1 1G1G 1GGGGAGAG 1 1 AGGA1 G 1 1GGA1GA1GAGGA1G 


IbO 1 


Qy 


1586 


CAGGCATCTTCACTTTTGAATGTGATACTATTCATGTCAGTGAGAGTATTGGTGTTATGG 


1645 






1 1 MINI 1 Ml 1 1 1 1 1 1 1 IMIMI 1 1 1 1 1 II 1 Mlllll III II II 1 MM Mill 1 1 




uu 




nTiPPPATPTTPA PTTTTP A A TPTP A T A PT 1 A TTP A TPTP7V PTP A nAPTn M" MPPfTPTifnTv rp/-i/">« 

GAGGGA1G 1 1LAL ill 1GAA1G1GA1 AG 1A1 1 GA1G 1 GAG 1GAGAG 1 Al 1GGTGTTATGG 


1561 


Qy 


1646 


AGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGTACAGTCATCGTCCCCTTTAGGACAG 


1705 






II M 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 M II 1 1 II 1 1 1 1 II 1 1 M M 1 1 1 1 II 1 1 1 1 1 M M 




nH 
uu 




HdnTCh. Z\ nm l T"T l P 1P PP , P'P'P' A P ATPAPPTPPPPPPPPTA P A PTP A mpPTPPPPTTT 1 ^ ppn 

Aljta 1 UAAvjLj 1 ILlbLboALAlLAljblbLLLbbbblALAblLAlLblLLLL 1 1 1AGGALAG 


162 1 


Qy 


1706 


TAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTTGAAGACACATATGGGGAGTTGGAAT 


1765 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




nH 
uu 


1 £9 9 


T A Pi A A nr~in A PAPPPA appp tp r~* r^r^r^n^r** appa ptttp a apapapata Ttr^rir^r* 7\ /"•ttpp tv 7\ t 
1 AbHAuuoALAVjLLAAbbb 1 GGGGG 1 GAGGAG 111 GAAGAG AG A 1A rGGGGAGTTGGAAT 


1681 


Qy 


1766 


TCAAGAATGATGaAACTGTGaaAACCATAAGGGTTAAAATAGTAGATGAGGAGGAATACG 


1825 






II 1 II III II 1 1 1 1 II II II 1 IMIIMI MM II II M M II 1 II II 1 MM M Ml II 




uu 


xo oz 


T'P* A SPA A T'P 1 A TP* AAA PTP TP A A A A PP A T A A pppmmi\ tv tv ATAPTAP7V TP 7\ r+r~* ~T\ r*l\ tv T7V /~i/"t 

1 v~ AALjAA 1 GA 1 bAAAL 1 G 1 bAAAALLA 1 AAGGG 1 1 AAAATAGTAGATGAGGAGGAATACG 


1741 


Qy 


1826 


aAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAACCGAAATGGATGGAACGTGGAATAT 


1885 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 






1 *7 A 9 
1 1 Z 


AA APPP A A P AP A A TTTPTTP A r PTPPPPT"T'PP'T'P "A APPHTi 7\ 7\ np/""»/-i 7\ m/in 7\ tv m/~l /""l 7\ ivmnm 

AAAbbb AAbAbAA 1 1 1G1 1GA1 1GGGG1 1GG IGAAGGGAAATGGATGGAACGTGGaAT AT 


1801 


Qy 


1886 


CAGATGTGACAGACAGGAAGCTGACTATGGAAGAAGAGGAGGCCAAGAGGATAGCAGAGA 


1945 






1 1 MM II III I II Ml MM Illlllllll II III 1 MM 1 lllllllll II II Mill 




nH 
uu 


1 o UZ 


PAPA T'P T'P APAPAPAPPA A P PTP A PTA TPP A APA 7V.P7VPP7\PPPP7V 7V P 7V PP TV T 7V P ^* 71 /~i tv /~i 7\ 

GAGA 1 G 1 GAG AG AG AGGAAGG 1 GAG 1 A 1 GGAAGAAGAGGAGGGGAAGAGGATAGCAGAGA 


1861 


Qy 


1946 


TGGGaAAGCCAGTATTGGGTGAACACCCCAAACTAGAAGTCATCATTGAAGAGTCCTATG 


2005 






Ml III M 1 III II II MM 1 1 1 II Ml III II Mlllll 1 Mill Ml II M II II 1 1 1 




uu 


1 Q £ O 
lOOZ 


TPPP A A APPPAPTA TTP P P TP AAPAPPPPAA APnPAPA7\ PTP 7\ rpp A TT^« 7\ 7\ /"I 7\ ^1 T/"*i^iT 7\ rn/"l 

1 GGGAAAGGG AG 1 A 1 1 GGG 1 GAAGAGGGGAAAGTAGAAGTCATCATTGAAGAGTCCTATG 


1921 


Qy 


2006 


AGTTCAAGACTACGGTGGACaAACTGATCAAGAAGACAAACCTGGCCTTGGTTGTGGGGA 


2065 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




nh 


1 Q9 9 


A PTTPA APA PTA C^C^C^TVC^C* A PA A A PTP A T'P AAPAAPAPAAA PPTPPPPmTPPmTipi'PPPnPa 

ALj 1 1 L.AAVjAL. 1 AL-LjG 1 GGAGAAAG 1 GA 1 GAAGAAGAGAAAGG 1 GGGG 1 1 GGTTGTGGGGA 


1981 


Qy 


2066 


CCCATTCCTGGAGGGACCAGTTCATGGAGGCCATCACCGTCAGTGCAGCAGGGGATGAGG 


2125 






M 1 II 1 1 1 M 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




nH 
uu 


J. 2 O Z 


PPP A TTPPTP P A PP*P 7V PP 7V PTiTP 7\ rri/ m t/ m % 7\ /T /~i /"i tv m/"i 7\ /"i/*"i/"Trn/i 7\ /"i m/""! /™i tv r*\ /"i tv /"i /"n"*t tv m/~i tv 

GGLA1 1GG1GGAGGGAGGAG1 1GA1GGAGGGGATCACCGTCAGTGGAGCAGGGGATGAGG 


2041 


Qy 


2126 


ATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCC 


2185 






1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




nH 
uu 




ATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCC 


2101 


Qy 


2186 


TGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCT 


2245 






1 II Mill 1 II 1 M II MIMI 1 II IMIIMI MM 1 II II 1 1 II III II II II 1 III 1 




nH 


9 1 no 
z x u z 


T'P A PTPTPTTPTPP A APPTPPTP TT TPP PTPTPTPPPPPPP A P A P A PT* A PTPPP7\ P/""/"i/"iT 

IVjAL. ILjIL-1 1 1 LiVjAAGG 1 GG 1 G 1 1 1GGG 1G 1G 1GGGGGGGAGAGAG 1 AG 1 GGCACGGCT 


2161 


Qy 


2246 


GGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACC 


2305 






MMMMMMMMMMMMMMMMMMMMMMIMMMMMMMI 




Db 


2162 


GGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACC 


2221 


Qy 


2306 


TGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCG 


2365 



Illllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

3GCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCG 2281 



Db 


2222 


TGGCCTCGCACTTCGGCTGCACCATTGGTClTPAAAnATTPAnTr i ArAOrTfiTTnTTTTr , n 


99R1 

4b O jL 


Qy 


2366 


TGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATG 


2425 






I 1 1 1 1 1 1 1 I I i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 

II II M M 1 1 M II 1 II II II II 1 II II 1 II II II 1 II 1 II II II 1 1 1 II II II II II 1 1 




Db 


2282 


TGGCATTTGGCACCTCTGTCCCAGATACGTTTGrrARrAAARrTGrTnrrrTPr'AnGATn 

vj v^ v^^i j. j. j. \rnw a w j. w i vv v*- **.vjxi a vj xxx u vrmnvjv x vj v_. x vJ V-« v_» ui\> v~xi.vj vjxi x VJ 




Qy 


2426 


TATATGCAGACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGG 


2485 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 

II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 1 II II M 1 M M 1 1 | 




Db 


2342 


TATATGCAGACGCCTCCATTGGCAACGTGACGGGC , AnrAAr , nPPRTrAATnTrTTr , rTr:n 


94. HI 
Z ft U J. 


Qy 


2486 


GCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCC 


2545 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 

1 1 1 1 II 1 I 1 1 II 1 1 1 1 1 1 II 1 II II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 M 




Db 


2402 


GCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTrr 

w v»* a a. \- vvj ^— ' v»> J- vj vj v» v» .1. vj vj J- v* v— VJ J. VJVJUV-VJV.. \ — ±l X w X XT. V— X VJ VJ VJ V— ■ X V* X VJ N^<iVjSJwiiV^/^.VjVJrt\J X X V, V— 


^ t D 1 


Qy 


2546 


ACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCT 


2605 






II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M II 1 1 II 1 1 II M 1 1 II II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I | | 




Db 


2462 


ACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACCCTCTTCACCATf'TTTnrATTTflTrT 

*^*^»>-» a v. ' a vwv v#- vj vj v~x^ v»*^Vj» j- vv vj \_» w x x v— x x v«nv« v* v-» x V-. x x v«>£^v-» v_».r^ x v» XXX w v<fl XXX vJ X ^> X 


9 591 


Qy 


2606 


GCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCC 


2665 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 I l l I i l I I I I I i i 

1 II 1 II II 1 1 1 M 1 M II II II II 1 1 1 II II II II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 




Db 


2522 


GCATCAGCGTGCTPTTGTArrfiAAnnPfinPPftPAPPTn^ 

VJ v.rt 1 v.rtv7 v.VJ 1 \Jv» 1 v. 1 X VJ lrtv.V-unftOVJv.v7v7v.v.Ov.rtv«v. A vJvjOrtOvJvvTvji-ivJv-. 1 L vJw 1 VjTvJ v-- V-. v-» v- v- 


OCQ1 

Z o o 1 


Qy 


2666 


GTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCT 


2725 






i II 1 1 1 1 II 1 1 1! 1 1 1 III 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 ! 1 1 1 ! 1 II 1 1 1 1 1 1 1 1 1 




Db 


2582 


GTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCT 


2641 


Qy 


2726 


TTGCCACACTAGAGGCCTATTGCTACATCAAGGGGTTC 2 763 








lllllll Ml II IIIIIIMI Mill IIIIIIIMIII 




Db 


2642 


TTGCCACACTAGAGGCCTATTGCTACATCAAGGGGTTC 2 679 
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PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, cardiomyopathy, 

PT atherosclerosis or diabetes, in chromosome mapping, tissue typing or in 

PT pharmacogenomics . 

XX 

PS Claim 20; SEQ ID NO 861; 1503pp; English. 
XX 

CC The invention relates to 566 new isolated human polypeptides and their 

CC encoding genes, sequences that are at least 95% identical to these or 

CC sequences comprising one or more conservative substitutions in these. The 

CC polypeptide, polynucleotide and antibodies against the polypeptides are 

CC useful in diagnosing, treating or preventing NOVX-associated disorders, 

CC e.g. cardiomyopathy, atherosclerosis, hypertension, cancer, obesity, 

CC diabetes, AIDS, multiple sclerosis, graf t-versus-host disease, 

CC Alzheimer's disease, Parkinson's disease, asthma, or fertility disorders. 

CC The nucleic acids are. further used as hybridization probes, in chromosome 

CC mapping, tissue typing, preventive medicine, and pharmacogenomics. The 

CC polypeptides are also useful as vaccines. This sequence represents an 

CC example of the nucleic acid sequence of the invention. 

XX 

SQ Sequence 2685 BP; 645 A; 657 C; 741 G; 642 T; 0 U; 0 Other; 

Query Match 96.6%; Score 2673.2; DB 12; Length 2685; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2675; Conservative 0; Mismatches 3; Indels 0; Gaps 0 
Qy 86 GAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGCACAGGGCAGAACAATGAGTCCTGTT 145 



Db 



2 




61 



QY 



146 



CAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTGCCAATCTGGTACCCGGAGAACCCTT 



205 



Db 



62 




121 



Qy 



206 



CCCTTGGGGACAAGATTGCCAGGGTCATTGTCTATTTTGTGGCCCTGATATACATGTTCC 



265 



Db 



122 




181 



Qy 


266 


TTGGGGTGTCCATCATTGCTGACCGCTTCATGGCATCTATTGAAGTCATCACCTCTCAAG 


325 






MM! Ml MM MM! IIIIIIIIMIIMMMIIIIIIIMIIIIIIIIi 




Db 


182 


ttpppptptppatpattpptpappppttpatpppatptattpa APTPaTPappTPTPaap 


£ *± X 


Qy 


326 


AGAGGGAGGTGACAATTAAGAAACCCAATGGAGAAACCAGCACAACCACTATTCGGGTCT 


385 






1 IMIIIIII Mill IMIIMIIIIMII 1 MM 1 MM 1 III Mill Mill! MM 




Db 


242 


an annn a pptp a p z\ a tt a & p a & a ppp i\ a thh a p a i\ a p p a p papa appap r ra r P r ppppp r PP r p 

rtOrtUUUrtUU X LjM.V-.rt/* X X HrtuHHHLLLHH X VjLjMLjAAALLALjLALAALLAL lAi 1 LOOO 1 L 1 


•j n 1 
0U1 


Qy 


386 


GGAATGAAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCCTCTGCTCCTGAGATAC 


445 






IIIIIIIIIIIIIMIIMIIIMIIMIIIIMIIMIIIIIIIIIIIIIIIIIIIIII 




Db 


302 


PP A ATP A A APTPTPTPPA a PPTP a PPPTT atppppptpppttpp'pptpp'pppt'p bp a t 1 a p 

uvjnft x lUl^l v_V_.rti-\.V-L- X VJi-lLL.\_» X XM.XVjVjL.LL X UOU X X LL 1 L 1 uL X LL X uHunlnv- 




Qy 


446 


TCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGATCTGGGACCTTCTA 


505 






1 IMIMMII MMMMMMMM Mill IMIIIIII IIIMMIM lllllllll 




Db 


J O £t 


T P P T PT P TT T A A T TP A PPTPTP TPPT P a T^HflTT P A TTP PTfi P Tf a T 1 P p p a p PTTPT'a 
x v^u x l x l X X XMM 1 x urtuu lulul X L,Hiuub X X LM 1 1 VjL X ouloHlt 1 uuufttL 11C 1 A 


/ion 
4^1 


Qy 


506 


CCATTGTAGGGAGTGCAGCCTTCAACATGTTCATCATCATTGGCATCTGTGTCTACGTGA 


565 






III Ml Mill Ml Ml MIMIII MM Ml MM III MM 




Db 


422 


PPATTPTAPPPAPTPPAPPPTTPA APATPTTPATP ATPaTTPPPATPTP r rP r PP r PaPP r rPa 

^— wrt. X X vjj XrtVJVJVJrtVJ X VJV_MVJV- V- X X LrtrtLrt lull Lrt X Lrt X Lrt X X VJvjLMX LXLjXLjXLX AL VJ X IjA 


*± O 1 


Qy 


566 


TCCCAGACGGAGAGACTCGCAAGATCAAGCATCTACGAGTCTTCTTCATCACCGCTGCTT 


625 






1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 t 1 I 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 




Db 


482 


TPPPAPAPPPAPAPAPTPPPA APATP A A APATPTA PPAPTPTTPTT'PaT'P a PPPP'TPPT'T 1 
x \^U\^rt.vjft^uui-iurturvL X LVjLMrtVjrt X LHrtrtLH 1 L 1 HLunu 1L1 1L1 1 LrtlLALLbL 1 VjL 1 1 


i)4 1 


Qy 


626 


GGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGG 


685 






1 1 i 1 1 1 1 1 1 1 1 1 1 1 I E 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 II MM; III MM 




Db 


542 


PPAPT ATPTTTPPPT A P ATPTPPPTPT ATA TPaTTPTPPP a P r PP r rT , P r PPPPP*T , PP r rP r rPP 
vjvjrtvj x rt x l x X xvjll irt^HiU XVjVjL X L IHlHluHl 1 L 1 uuLnu 1 L 1 1L1 LLLL ibulo lou 


bUl 


Qy 


686 


TCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCT 


745 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E i 1 1 J 




Db 


O V ^ 


TPP APPTTTPPP A APPPPTPPTP APTPTPTTPT l T'PT n T , T , PPaPT , PTV^T n PT , PPT'T , PT^PPPP r P 
x LLMLjLjX X X uuortHuoL-L ILL 1LHL ILlLHLllLil ILLALjILj lululLL 1 1L ILjVjLL 1 


DO 1 


Qy 


746 


GGGTGGCAGATAAACGACTGCTCTTCTACAAATACATGCACAAAAAGTACCGCACAGACA 


805 






II MIM III IIIMMIM III MM II! MIMIMI MM III 




Db 


662 


PPPTPPPAP ATA A APP APTPPTPTTPTAPa a a'papaTPPa pa a a a aPT»a pppp a pap a p a 
vjvjvj x uuLrturt x rtrtMLVjrtL X 1L1 1 L 1 AL AAA X ALA 1 bLnLAAAftAb 1 ALLbLALAbALA 




Qy 


806 


AACACCGAGGAATTATCATAGAGACAGAGGGTGACCACCCTAAGGGCATTGAGATGGATG 


865 






'MM Ml MM II! Ill III MM MM II! MM MMMI 




Db 


7?? 


A AP APPPAPPA ATT ATP AT APapaPAP APPPTPa ppapppT^a appppa r r r rp apa'pppannp 
rtrtLrtLLortVjVjrtrtX lAi LAlAbHbALAbHbbb IbALLALLL 1 AAbbbLAl IbAbAlbbAlb 


/O 1 


Qy 


866 


GGAAAATGATGAATTCCCATTTTCTAGATGGGAACCTGGTGCCCCTGGAAGGGAAGGAAG 


925 






II INI II III MM 1 MUM MM 1 MM 1 MIMIMI MMMI III MM 1 1 III 




Db 




ppa a a a TP. a TP. a bttpppb r P r r r r r rp r ra p aTPPP a a pptpptppppptpp a a ppp a app 7\ 7v p 

bbnftArt 1 Li A 1 LjAA 1 1LLLA1 1 1 1L X AbAlbbbAALL ibblbLLLL IbbAAbbbAAboAAb 


841 


Qy 


926 


TGGATGAGTCCCGCAGAGAGATGATCCGGATTCTCAAGGATCTGAAGCAAaAACACCCAG 


985 






1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 




Db 


o *± ^ 


TPP ATP aptppppp AP APapaTPaTPPPPa r r r rp r rpa appan^pTP a app a a a a apapppap 

X Vj vjM X VjrtL* X L L LLjL ALjALjALjA 1 LjA 1 L LLjLjA 1 1L1 LAAbbA 1 L 1 bAAbLAAAAALAL L L Avj 


901 


Qy 


986 


AGAAGGACTTAGATCAGCTGGTGGAGATGGCCAATTACTATGCTCTTTCCCACCAACAGA 


1045 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 M 1 1 1 1 




Db 


902 


APAAPPAPTTAPATPAPPTPPTPPAPATPPPPA ATTAPTATPPTPTTTPPPaPPa apapa 
.«.vj.rvr*\JVJ/-i\»» x x n.\Jf\ x v^.j-i.vav- X uU X VJVJrt\Jrt X <—.MM X X ML Inl 1L111 LLLnL LMrtL AVjA 


y 0 1 


Qy 


1046 


AGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGTATGATGACTGGTGCAGGCAATATCC 


1105 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


962 


AGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGTATGATGACTGGTGCAGGCAATATCC 


1021 


Qy 


1106 


TGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCCTCCAGCATGAGCGAGGTGCACACCG 


1165 



Illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

TGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCCTCCAGCATGAGCGAGGTGCACACCG 1081 



Db 


1022 


TGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCCTCCAGCATGAGCGAGGTGCACACCG 


1081 


Qy 


1166 


ATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTTGACCCATGTTCTTACCAGTGCCTGG 


1225 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 II M 1 1 II 1 1 1 II 1 II 1 1 1 II 1 1 II II 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 II II 1 1 1 1 1 II III 




Db 


1082 


ATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTTGACCCATGTTCTTACCAGTGCCTGG 


1141 

J. _L *± J. 


Qy 


1226 


AGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGGAAAGGGGGAGACATGTCAAAGACCA 


1285 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 M II 1 I M II 1 1 II II 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


1142 


AGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGGAAAGGGGGAGACATGTCAAAGACCA 


1201 


Qy 


1286 


TGTATGTGGACTACAAAACAGAGGATGGTTCTGCCAATGCAGGGGCTGACTATGAGTTCA 


1345 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
1 1 1 1 M 1 II II II 1 II II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 I I I I 1 1 1 1 1 1 I I | | | | | 1 | | | | | | | | 




Db 


1202 


TGTATGTGGACTACAAAACAGAGGATGGTTCTGCCAATGCAGGGGCTGACTATGAGTTCA 


1261 


Qy 


1346 


CAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACCCAGAAGGAGTTCTCCGTGGGCATAA 


1405 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 II 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1262 


CAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACCCAGAAGGAGTTCTCCGTGGGCATAA 


1321 


Qy 


1406 


TTGATGACGACATTTTTGAGGAGGATGAACACTTCTTTGTAAGGTTGAGCAATGTCCGCA 


1465 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1322 


TTGATGACGACATTTTTGAGGAGGATGAACACTTCTTTGTAAGGTTGAGCAATGTCCGCA 


1381 


Qy 


1466 


TAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCAGCAATATTCAACAGTCTTCCCTTGC 


1525 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1382 


TAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCAGCAATATTCAACAGTCTTCCCTTGC 


144 1 


Qy 


1526 


CTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGATGATGACCATG 


1585 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1442 


CTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGATRATnAPPATn 


1 Rfil 


Qy 


1586 


CAGGCATCTTCACTTTTGAATGTGATACTATTCATGTCAGTGAGAGTATTGGTGTTATGG 


1645 






I 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II II M II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


1502 


CAGGCATCTTCACTTTTGAATGTGATACTATTCATGTCAGTGAGAGTATTGGTRTTATnn 

w ^— '*- i x x x vi iv x x x x x \j x vjii X X *a. X X vxl X \J X ^n\J X wnwriw X X X \J\J X \J X X xi. X VjVj 


X J O _L 


Qy 


1646 


AGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGTACAGTCATCGTCCCCTTTAGGACAG 


1705 






I 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 

II 1 1 II 1 II II 1 II II II II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II I I 




Db 


1562 


AGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGTACAGTCATGGTGGGGTTTAGGArAG 


1 69 1 


Qy 


1706 


TAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTTGAAGACACATATGGGGAGTTGGAAT 


1765 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


1622 


TAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTTGAAGACAGATATGGGGAGTTGGAAT 


J- D O x. 


Qy 


1766 


TCAAGAATGATGAAACTGTGAAAACCATAAGGGTTAAAATAGTAGATGAGGAGGAATACG 


1825 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 II II 1 1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 1 I I I I M 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1682 


TC AAGAATGATGAAACTGTGAAAAC CATAAGGGTTAAAAT AGT AG ATG AGG AGG A AT A GG 


1 741 


Qy 


1826 


AAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAACCGAAATGGATGGAACGTGGAATAT 


1885 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 
1 1 1 M 1 1 M 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 II 1 II 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1742 


AAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAACCGAAATGGATGGAACGTGGAATAT 


1801 


Qy 


1886 


CAGATGTGACAGACAGGAAGCTGACTATGGAAGAAGAGGAGGCCAAGAGGATAGCAGAGA 


1945 






MINI II II II 1 1 1 1 1 1 III II 1 1 1 1 1 II 1 1 1 ! 1 1 1 1 1 1 I 1 1 1 M 1 1 1 II II 1 II 1 1 1 1 




Db 


1802 


CAGATGTGACAGACAGGAAGCTGACTATGGAAGAAGAGGAGGCCAAGAGGATAGCAGAGA 


1861 


Qy 


1946 


TGGGAAAGCCAGTATTGGGTGAACACCCCAAACTAGAAGTCATCATTGAAGAGTCCTATG 


2005 



Ml Mill MM MINIMI Mill Mill llllllllllllll Mill llllllllll 



Db 


1862 


TPPPA a apppaptattppptpaapappppa. a aptapa aptpahppat'T'p a araPTPPTaTr 


± y z i 


Qy 


2006 


AGTTCAAGACTACGGTGGACAAACTGATCAAGAAGACAAACCTGGCCTTGGTTGTGGGGA 


2065 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


1922 


ZXPTTP A. APAPT 1 APPPTPP AP A A SPT/I^TPa RPA APAPA A7\ PPTPPPPTTPPTTPTrTTT" A 

X X Lrtrturtt l /iLiaL* 1 L*L?rt.L>\M/*L 1 urt i LHRbHHbALAAALL 1 IjLtLL 1 1 vjvj 1 Ibl bbobA 


TOOT 


Qy 


2066 


CCCATTCCTGGAGGGACCAGTTCATGGAGGCCATCACCGTCAGTGCAGCAGGGGATGAGG 


2125 


Db 




II M IT II ' II III IMIHII MM III MIMM II 




1982 


rfTATTPrTnnjifipjiArpiiP^TpaTPPAppppATPAPppTPAPTppAppAPPPpATPAPn 

Vw-V^v^-M. 1 ibLl L^AL'oL'/iL^H.Vjl ± LA1L*LtAL*uLLAI lALLVj 1 LALt I L*LAUlA(jLtovjA IuACjG 


O Ail 1 

2041 


Qy 


2126 


ATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCC 


2185 






MlllllMIII III MM IM II Mil, 1! IIIIIIIIMMIMII 




Db 


2 042 


A TP Z\ fin A TP 2X A T P PPPP P & PP A P A np, p TP p p pt p p tp p tttp a p t 1 a pp t 1 p a tp pa pt*t»pp 
1 o.rt.vjOM. i o/Aurt. i LLuUuortuuHu/iuuL 1 L*LLL 1 1 bL 1 1 ILrAL 1 ALL* 1 LAI L*LAL 1 1 L-L 


2101 


Qy 


2186 


TGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCT 


2245 






III 1 Mill INI INI 1 llllllll 1 Ml Mill 1 II 1 1 1 III 1 MM MM llllll 




Db 


£t X \J 4* 


TPAPTPTPTTPTPPA ZinP,TP,PTP,TTTPPPTPTPTPPPPPPPBPAPAPT 1 A r^rrnnr* a rr«r«r"r» 
HjHbiuiL 1 1 LlbbMAbblbL lull loUL lo 1 L* 1 LLLLLLLAlALtAL? 1 AL xoLLAlCjvjL 1 


2161 


Qy 


2246 


GGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACC 


2305 






1 1 II II II 1 II 1 III II II 1 1 1 1 MM 1 1 1 MINI 1 1 Ml III 1 III 1 MM II Mill 




Db 


2162 


GGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACC 


2221 


Qy 


2306 


TGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCG 


2365 






II M 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 II M 1 1 1 1 




Db 


2222 


TPPPPTPPP APTTPPPPTPP A PP ATTPPT'P'T'P A A APATTPAPTPAPAPPTPTTPTTTTPP 


22 ol 


Qy 


2366 


TGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATG 


2425 


Db 


O 


1 M 1 II II h 1 1 1 1 1 1 II .III II 1 IM 1 1 II 1 1 III 1 1 II M 1 III 1 Mill Ml 1 

TfJnrATTTnnPAPPTrTnTPPPaPZiTAPPTTTPPPAPPA A APPTPPTPPPPTPPAPP A T"P 

ibbbHi 1 ibbbALL Iblbl LLLAbAlALb 1 1 i bbb AbbAAAbL 1 bL 1 bLLL 1 bbAbbA I b 


2341 


Qy 


2426 


TATATGCAGACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGG 


2485 


Db 




1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




? 14? 


TATZ\TPPAPZ\PPPPTPP2\TTPPPA APPTPAPPPPPAPPA A ppppptpa a tp'"pp«t»t>pp'tipp 
liiiHibLHbHLuLL 1 LLA1 IbbLAAbblbALbbbLAbLAALbLLo 1 LAAlbl L IxLLTGG 


2401 


Qy 


2486 


GCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCC 


2545 


Db 




M 1 1 II MM 1 1 II 1 1 Mill III Mlllllll 1 MM 1 III 1 MM 1 III Mill 




4 *± w ^ 


fir & TPnnPPTPnPPTPPTPPPTPPPPPPP A TPT a ptpppptptp p a pph a nAPP7\ PTTinn 
ULA 1 Lbbbb 1 bbbb 1 LLr 1 Ltb 1 L*L»LLL*LLA iLl At 1 IjL*<jL 1 L 1 LLAl*LGAlAGGAGTTlC 


2461 


Qy 


2546 


ACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCT 


2605 


Db 




III 1 Mill II 1 1 1 II 1 II INI 1 1 II II 1 IM 1 M 1 1 1 M 1 1 1 Ml 1 IMIIMI 1 




94 fi9 

^■T Q^ 


Zi PPTPTPPPPPPP PAPA PTPP PPTTPTPPPTP A PPPTPTTP 7\ PP7V rnomrnfprtrtTi rnrnrnnmnm 
Hbb 1 b 1 bbbbbbbb AbAL IbbLL 1 Ibl LLL* 1 LAbLL J. L 1 1 lAllAxlTTTGlATTTGTlT 


2521 


Qy 


2606 


GCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCC 


2665 






II 1 III II III MIMSIMI MM 1 IMIIIIM IMIIMI llllll III Mill 1 ! 




nh 




L»L A 1 LAbLb 1 LjL 1L11 Kj i ALLvaAAUbLOOLLGCALCTGGGAGGGGAGCTTGGTGGCCCCC 


2581 




Z D D D 


PT'PPPT'^IP A Hr^r^TT^r^ PP A PA A PA TPPPTPTTTPTT' a PPPT»prnpP Pn"</"i/"'«rTi/"irp7V ri t\ ititv ^mom 
L* 1 L»L*L 1 oLAAoL x LbLLALAALA 1 LjVjL I L i 1 1 Lj I bjAtjLL 1 Cj 1 L*GL T CCTCTACATACTCT 


2725 






MMM III Mlllllll MM II II llllllll Mlllllll IMIIMI III II 




Db 


2582 


GTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCT 


2641 


Qy 


2726 


TTGCCACACTAGAGGCCTATTGCTACATCAAGGGGTTC 2763 








II M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 III 1 1 1 1 1 1 




Db 


2642 


TTGCCACACTAGAGGCCTATTGCTACATCAAGGGGTTC 2 679 





RESULT 14 
ADH42312 



ID ADH42312 standard; DNA; 2685 BP. 
XX 

AC ADH42312; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Novel human nucleic acid NOV55f . 
XX 

KW ds; gene; cardiovascular; antiarteriosclerotic ; hypotensive; cytostatic; 

KW anorectic; antidiabetic; immunosuppressive; anti-HIV; neuroprotective; 

KW nootropic; antiparkinsonian; antiasthmatic; antiinf ertility ; 

KW cardiomyopathy; atherosclerosis; hypertension; cancer; obesity; diabetes 

KW AIDS; multiple sclerosis; graf t-versus-host disease; Alzheimer's disease 

KW Parkinson's disease; asthma; fertility disorder; chromosome mapping; 

KW tissue typing; preventive medicine; pharmacogenomic; vaccine. 
XX 

OS Homo sapiens . 
XX 

PN WO2003102159-A2 . 
XX 

PD ll-DEC-2003. 
XX 

PF 04-JUN-2003; 2 003WO-US017573 . 
XX 

PR 04-JUN-2002; 2 002US - 03 854 90P . 

PR 04-JUN-2002; 2 002US - 03 85615P . 

PR 04-JUN-2002; 2 002US - 03 85755P . 

PR 05-JUN-2002; 2002US -0386041P . 

PR 06-JUN-2002; 2002US -0386355P . 

PR 06-JUN-2002; 2 002US-03 86357P . 

PR 06-JUN-2002; 2 002US-03 86447P . 

PR 06-JUN-2002; 2002US-0386459P. 

PR 06-JUN-2002; 2002US-0386465P. 

PR 06-JUN-2002; 2002US-03 86864P . 

PR 07-JUN-2002; 2002US-03 86701P . 

PR 07-JUN-2002; 2002US -03 86796P . 

PR 07-JUN-2002; 2002US -03 86931P . 

PR 07-JUN-2002; 2002US-0387078P. 

PR 07-JUN-2002; 2002US -03 87081P . 

PR 07-JUN-2002; 2002US -03 87083P . 

PR 10-JUN-2002; 2002US -03 87429P . 

PR 10-JUN-2002; 2 002US - 03 8754 OP . 

PR 10-JUN-2002; 2002US-0387866P. 

PR ll-JUN-2002; 2002US-03 87606P . 

PR ll-JUN-2002; 2002US -03 87610P . 

PR ll-JUN-2002; 2002US-0387659P . 

PR ll-JUN-2002; 2 002US - 03 87668P . 

PR ll-JUN-2002; 2 002US - 03 87696P . 

PR ll-JUN-2002; 2 002US - 03 87859P . 

PR 12-JUN-2002; 2 002US - 03 87934P . 

PR 12-JUN-2002; 2 002US - 03 87 960P . 

PR 12-JUN-2002; 2002US-0388022P . 

PR 12-JUN-2002; 2002US-03 88096P . 

PR 12-JUN-2002; 2 002US - 03 88432P . 

PR 12-JUN-2002; 2 002US - 03 88479P . 

PR 13-JUN-2002; 2 002US - 03 89123P . 

PR 14-JUN-2002; 2 002US - 03 89120P . 



PR 14-JUN-2002; 2 002US - 03 89146P . 

PR 17-JUN-2002; 2002US-0389742P . 

PR 18-JUN-2002; 2002US-0389604P . 

PR 18-JUN-2002; 2002US-03 89884P . 

PR 19-JUN-2002; 2002US-0390006P . 

PR 19-JUN-2002; 2002US-03 90144P . 

PR 19-JUN-2002; 2002US-0390209P . 

PR 25-JUN-2002; 2002US-0391726P . 

PR 06-AUG-2002; 2002US-0401628P . 

PR 09-AUG-2002; 2002US-0402268P . 

PR 12-AUG-2002; 2002US-0402822P . 

PR 13-AUG-2002; 2002US-0403458P . 

PR 15-AUG-2002; 2002US - 0403617P . 

PR 15-AUG-2002; 2002US -04 03732P . 

PR 26-AUG-2002; 2002US -0406182P . 

PR 12-SEP-2002; 2002US-04 10085P . 

PR 13-SEP-2002; 2002US- 04 10505P . 

PR 23-SEP-2002; 2 0 02US - 04 12 955P . 

PR 30-SEP-2002; 2 002US - 04 15195P . 

PR 23-OCT-2002; 2 002US - 042 0627P . 

PR 23-OCT-2002; 2002US-0420718P . 

PR 24-OCT-2002; 2002US- 0420852P . 

PR 31-OCT-2002; 2002US- 0422750P . 

PR 01-NOV-2002; 2002US-0423095P . 

PR 05-NOV-2002; 2002US-0423748P . 
XX 

PA (CURA-) CURAGEN CORP. 

XX 

PI Al sob rook JP, Anderson DW, Baumgartner JC, Berghs C, Boldog FL; 

PI Burgess CE, Casman SJ, Catterton E, Dhanabal M, Edinger SR; 

PI Ellerman K, Ettenberg S, Gangolli EA, Gerlach VL, Gorman L; 

PI Grosse WM, Gunther E, Guo X, Gusev VY, Herrmann JL, Ji W, Kekuda R; 

PI Khramtsov NV, Larochelle WJ, Li L, Liang H, Low K, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mcqueeney K, Mezick AJ, Miller CE; 

PI Millet I, Padigaru M, Patturajan M, Peyman JA, Qian X, Rastelli L; 

PI Rieger DK, Rothenberg ME, Shenoy SG, Shimkets RA, Smithson G; 

PI Spytek KA, Stone DJ, Sukumaran S, Szekeres ES, Vernet CAM, Voss EZ; 

PI Wolenc AR, Zhong M, Zhong H; 

XX 

DR WPI; 2004-053467/05. 

DR P-PSDB; ADH42313. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, cardiomyopathy, 

PT atherosclerosis or diabetes, in chromosome mapping, tissue typing or in 

PT pharmacogenomics . 

XX 

PS Claim 20/ SEQ ID NO 865; 1503pp; English. 
XX 

CC The invention relates to 566 new isolated human polypeptides and their 

CC encoding genes, sequences that are at least 95% identical to these or 

CC sequences comprising one or more conservative substitutions in these. The 

CC polypeptide, polynucleotide and antibodies against the polypeptides are 

CC useful in diagnosing, treating or preventing NOVX-associated disorders, 

CC e.g. cardiomyopathy, atherosclerosis, hypertension, cancer, obesity, 

CC diabetes, AIDS, multiple sclerosis, graf t-versus-host disease, 

CC Alzheimer's disease, Parkinson's disease, asthma, or fertility disorders. 



CC The nucleic acids are further used as hybridization probes, in chromosome 

CC mapping, tissue typing, preventive medicine, and pharmacogenomics . The 

CC polypeptides are also useful as vaccines. This sequence represents an 

CC example of the nucleic acid sequence of the invention. 
XX 

SQ Sequence 2685 BP; 645 A; 657 C; 741 G; 642 T; 0 U; 0 Other; 



Query Match 96.6%; Score 2673.2; DB 12; Length 2685; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2 675; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 



Qy 


86 


GAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGPACAGGGPAGAAPAATOAGTPPTPTT 


X *± ZJ 






II 1 1 1 1 1 1 1 1 ! 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 




Db 


2 


GATCCGAGGCTGGTGGCTCAGGGGACGTGCCAAGCACAGGGCAGAACAATGAGTCCTGTT 


61 


Ov 


146 


CAGGGTCATCGGACTGCAAGGAGGGTGTPATPPTrirrAATPTnrJT2VPPPOrian2X2iPPPTT 


one: 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 III 1 Ml 1 1 1 




Db 


62 


CAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTGCCAATCTGGTACCCGGAGAACCCTT 


121 


Ov 


206 


CCCTTGGGGACAAGATTGCCAGGGTPATTnTPTATTTTfJTnPPPPTPATATAPATPTTPP 


9 £ c; 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


122 


CCCTTGGGGACAAGATTGCCAGGGTCATTGTCTATTTTGTGGCCCTGATATACATGTTCC 


181 


Ov 
vy 


266 


TTfifiOnTOTPP ATP ATTPPTf? APPPPTTP ATnPPL\TPTZiTTPZ\ anTnaTfarrTPTPB z\n 


7 0 1= 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


182 


TTGGGGTGTCCATCATTGCTGACCGCTTCATGGCATCTATTGAAGTCATCACCTCTCAAG 


241 


Ov 
vy 


3 2 6 




JO J 






1 II II 1 1 III 1 MM 1 1 II 1 1 T II MM 1 1 II 1 1 1 1 1 !! ' M 1 Mi 1 1 1 1 1 II 




Db 


242 


AGAGGGAGGTGACAATTAAGAAACCCAATGGAGAAACCAGCACAACCACTATTCGGGTCT 


301 


Ov 
vy 


3 86 


GGAATGAAAPTGTPTCPAAPPTnAPPPTTATPfiPPPTPPPTTPPTPTPPTPPTPl\PI\Tl\P 


A A R 
ft *t D 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


302 


GGAATGAAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCCTCTGCTCCTGAGATAC 


361 


Ov 
vy 


446 


x LL x -L-ri-rt. X ivjriuolulul x Lftlobo 1 X Ln. 1 x bL lvjvj luAiL 1 vjovjAll 1 ILin 


c n c 






1 II 1 1 1 1 II 1 1 II 1 ! 1 1 1 1 1 Ml 1 1 1 1 1 III II 1 1 1 1 II 1 1 1 II 1 1 1 IMIII 1 1 III 1 1 




Db 


362 


TCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGATCTGGGACCTTCTA 


421 


Qy 


506 


CCATTGTAGGGAGTGCAGCCTTCAACATGTTCATCATCATTGGCATCTGTGTCTACGTGA 


565 






1 III II III Mill 1 MM 1 III 1 IMMIIII IIIIMIII 1 IIIIIMII Mill 1 II 




Db 


422 


CCATTGTAGGGAGTGCAGCCTTCAACATGTTCATCATCATTGGCATCTGTGTCTACGTGA 


481 


Qy 


566 


TCCCAGACGGAGAGACTCGCAAGATCAAGCATCTACGAGTCTTCTTCATCACCGCTGCTT 


625 






III ; 1 1 1 M 1 : III IIIIIIIIIIIIIIIIIIIIIMIIIIIIII 




Db 


482 


TCCCAGACGGAGAGACTCGCAAGATCAAACATCTACGAGTCTTCTTCATCACCGCTGCTT 


541 


Qy 


626 


GGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGG 


685 






M II 1 II II M II II 1 1 1 1 1 1 II II 1 M 1 II II 1 1 1 II 1 1 1 1 II 1 M 1 II 1 1 II 1 1 1 II 1 




Db 


542 


GGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGG 


601 


Qy 


686 


TCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCT 


745 






II 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 II 1 M 1 II 1 II 1 M II 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 II M 1 1 1 




Db 


602 


TCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCT 


661 


Qy 


746 


GGGTGGCAGATAAACGACTGCTCTTCTACAAATACATGCACAAAAAGTACCGCACAGACA 


805 



MM Ml MINIMUM III MIMM llllilll MlhMMM 



Db 


66? 


ppptpppapataaappaptpptpttptapaa atapatppapa a a a aptappppapapapa 

XoVjV_rtOrt.Xrtrtrt.\_Vjrt.O Ivl J. /iV*nHH.lHLfi 1 oV_rtV_rtrtrt\rtrto lrt.V_V_VjV_rt.V_/lort.V_rt. 


791 
/ Z 1 


Qy 


806 


AACACCGAGGAATTATCATAGAGACAGAGGGTGACCACCCTAAGGGCATTGAGATGGATG 


865 






Mill 1! II 1 II 1 1 1 M 1 1 MM II Mill 1 1 II II II 1 1 III 1 II 1 II 1 MINIM 




Db 


722 


A APAPPPAPPA A TT ATP AT AP AP A PAPAPPPTP A PP A PPPT A APPPP A TTna PA TPP A TP 
nnLnL OOrtOOrtrt. 1 Inl Ort. X rtOrtOrtOrtOrtOOO X UnL V_rt.V_ V_L. 1 rtrt.VjVjVjV_rt. X X Onun X OOrt 1 Vj 


/ O 1 


Qy 


866 


GGAAAATGATGAATTCCCATTTTCTAGATGGGAACCTGGTGCCCCTGGAAGGGAAGGAAG 


925 






1 1 1 II II M II II 1 II 1 1 1 II 1 1 1 II M 1 II M 1 1 II II 1 1 II II 1 II II 1 II 1 II 1 M 1 




Db 


782 


GG AAAA TGATGAATTPPPATTTTPTAPATPPP A APPTPPTPPPPPTPP A APPP A APP A AP 

\j\jr^nr^n. x. \jtr^ x wrvri j_ _l v_ \_ ^.rt. x X X X V* X / iAJ.rt. X OOOrtJ-l\_ V— X OO X Vj V_ V_- V_- X OOrviOOvJrlnoOrtfiu 


Oft 1 


Qy 


926 


TGGATGAGTCCCGCAGAGAGATGATCCGGATTCTCAAGGATCTGAAGCAAAAACACCCAG 


985 






1 M III 1 1 III 1 II 1 1 1 II 1 1 1 1 II 1 1 1 1 llllll 1! Ill 1 1 II 1 II 1 1 1 1 III III 1 1 1 




Db 


842 


TGGATGAGTCCCGCAGAGAGATGATCCGGATTCTCAAGGATCTGAAGCAAAAACACCCAG 


901 


Qy 


986 


AGAAGGACTTAGATCAGCTGGTGGAGATGGCCAATTACTATGCTCTTTCCCACCAACAGA 


1045 






MMM III MIIMIMM Mill MIMIIIMMMIMM Mill Mill MM 




Db 


902 


APA APPAPTTAPATPAPPTPPTPP APATPPPPA ATTAPTZiTPPTPTTTPPPAPPA Apar'a 
rtOrtrtO OrtO x x fturt X V-AuL X Oo X oortort. X UuULnn X 1 rt.V_ X rt. X VjV_ 1L1 1 1 LLLHtUAnLnbn 


JDl 


Qy 


1046 


AGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGTATGATGACTGGTGCAGGCAATATCC 


1105 


Db 


962 


IIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIII 

APAPPPPPPPPTTPTAPPPTATPPA APPPAPTPPTATPATPAPTPPTPPAPPPA. QTATPP 
rtVJrtOV_OOV_OV_0 11V- X .rt.v_.v_o Xrt. X V_v_rtrt.VjV_V_rt.v_ x wlnlunlunL X OO X OV_rt.VjOV_.rtrt. X rt. X V_ V_ 


i no i 

lUZl 


Qy 


1106 


TGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCCTCCAGCATGAGCGAGGTGCACACCG 


1165 






II 1 MM III II III 1 1 MM 1 MM IIIIIIMI Mill 1 MM Mill Mill MM 




Db 


J- V/ Z. 


TP A apa A apatppapp apa ap a apppa anaannppTnpappaTPnpppappTPPapBppp 

X unnOrtnrtLH X VjOiAoV_rtOrt«rtOrtrt.oV_V_rtrt.VjrtrtOO 1 V_V_rtoV_rt. 1 vjrt.vjV_ort.vjo 1 0 L-rt.V_rt.V_ V_ o 


1 Pi Q T 


Qy 


1166 


ATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTTGACCCATGTTCTTACCAGTGCCTGG 


1225 






II 1 MM III II 1 II II MM 1 MM Mill MM 1 MM 1 MM IMIIIMM MM 




Db 


1 Oft? 


ATPAPPPTPAPPAPTTTATTTPP A APPTPTTPTTTPAPPP A TPTTPTT A PP APTPPPTPP 
rt.XOrt.OV_V_ X UnuUnL X X X rt X X X oV_rtrt.VjVj XV_XXv_XXX Ort.V_v_V_rt.Xo 1 X V_ X 1 rt.V-V_rt.Vj 1 VjV_V_ loo 


1 14 1 


Qy 


1226 


AGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGGAAAGGGGGAGACATGTCAAAGACCA 


1285 






Mill Mill Mill MMMMIMMIMM IIIIIIIIIIIMIMIMIIIMIII 




Db 


1142 


AGAAPTPTPPPPPTPTAPTPPTPAPAPTPPTPAPPAAAPPPPPAPAPATPTPA AAPAPPA 

rt.VJrtrt.x_ 1VJ1 OOOOV_ X O X rt.V_ X OV_ X OrtV_rtO X OO X OrtOOrtrtrtOVjOOOrtOrtOrt X O X Ortrtrt.Vjrt.V_ V_rt, 


1 1 n i 

1ZU1 


Qy 


1286 


TGTATGTGGACTACAAAACAGAGGATGGTTCTGCCAATGCAGGGGCTGACTATGAGTTCA 


1345 






1 MIIMIMM II MIIIIMMMIMM MIIMMIMM MMM MMIIMM 




Db 


1 9 09 


TPTATPTPPAPTAP A A QAPZiPaPPaTnPTTPTPPPQaTPPaPPPPPTPaPTaTPBPTTPa 
XOXrtXOXOOi^V_Xrt.V_rtrtrtrt.V_rt.VjrtOVJrt.lVjol 1 V_ loV_V_rtrt,l OV,rt.oVjVjVjV_ loAV_ lAloAol lV_A 


1 oci 


Qy 


1346 


CAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACCCAGAAGGAGTTCTCCGTGGGCATAA 


1405 






II II Mill MM 1 MM 1 MM Mill INI lllllllll 1 MM 1 II II 1 lllllll 




Db 


1 9 fi9 


PttfiZinpnP2VPPPTPP,TTPTPaapppappapapapppapaappaPTTP f rppp'pppppA r r7\ a 
v_rt.ort.ooov_rt,v_oo x Vjo x x v_ x ortrt.ov_ v_rt.vjort.ort,ort.v- v_ v_rt.ortrt.ovjrt.o 1 1L1 v_v_o 1 uuuLA x AA 


ljzl 


Qy 


1406 


TTGATGACGACATTTTTGAGGAGGATGAACACTTCTTTGTAAGGTTGAGCAATGTCCGCA 


1465 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIII 




Db 


1 "*99 

1 J Z z. 


TTPATPAPPAPATTT r P''r , PAP-P-APPA'T l P AApa PTTPTTTPTA T\ppmmp tihptvtv rppmop/inT* 

i 1 ort. X Vjrt.v_Ort.v_rt. 1 ill lV_»rt.ooKoort.lVjrtrt.v_rt.v_ 1 1L1 1 IVjIAAooI 1 bAbLM 1 o 1 v_v_oV_A 


13 81 


Qy 


1466 


TAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCAGCAATATTCAACAGTCTTCCCTTGC 


1525 






MM MM IMMM MM Mill IMMM III III IIMMIII II 




Db 


1 7 ft 9 
1 J o z 


Tapappappappapppap apparr i PP7\ r pr , pr ir pr i r , APr a ata http 7\ APAPTnTTPPPTTpp 
l rt Vjrt Vjorto orto v_ rt.Vj o v_ rt.Vjrt.Vjvj A VjoLjVj A 1 Vj v_ v_ 1 v_ L, Avj v_ AA 1 A 1 1 v_ AAv_ Avj 1 v_ I 1 v_ v_ v_ 1 IvjL 


1 A A 1 

1441 




I o 


PTPPPPPTPTPPTaPPPTPPPPTTPTPTPPPPS PAPTTA PPATfTTPP A TP A TP A PP ATP 
V-. 1 V_VjVjVjO XVjX V_V_ Xrt.VjV-.V_ X V_V_V_V_ X X Vj X Vj looV_V_rt.V_rt.O 1 1 rt.V_V_rt. 1 v_ i looAloAloAV«V_Al Vj 


1 c o c 






1 1 1 II 1 1 1 II M 1 1 M 1 1 1 1 1 III 1 1 1 II MM II 1 MM 1 1 M INI II II 1 II MM 1 




Db 


1442 


CTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGATGATGACCATG 


1501 


Qy 


1586 


CAGGCATCTTCACTTTTGAATGTGATACTATTCATGTCAGTGAGAGTATTGGTGTTATGG 


1645 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 II M 1 1 1 1 1 M 1 II M M II 1 M 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


1502 


CAGGCATCTTCACTTTTGAATGTGATACTATTCATGTCAGTGAGAGTATTGGTGTTATGG 


1561 



Qy 


1646 


AGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGTACAGTCATCGTCCCCTTTAGGACAG 


1705 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 II 1 1 II II 1 II II 1 1 1 II II II 1 1 1 1 1 1 1 1 1 1 1 I II M 1 1 II 1 II 1 1 1 1 1 1 1 1 1 




Db 


1562 


AGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGTACAGTCATCGTCCCCTTTAGGACAG 


1621 


Qy 


1706 


TAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTTGAAGACACATATGGGGAGTTGGAAT 


1765 






1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


1622 


TAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTTGAAGACACATATGGGGAGTTGGAAT 


1681 


Qy 


1766 


TCAAGAATGATGAAACTGTGAAAACCATAAGGGTTAAAATAGTAGATGAGGAGGAATACG 


1825 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1682 


TCAAGAATGATGAAACTGTGAAAACCATAAGGGTTAAAATAGTAGATGAGGAGGAATACG 


1741 


Qy 


1826 


AAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAACCGAAATGGATGGAACGTGGAATAT 


1885 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 
M II II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 1 1 




Db 


1742 


AAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAACCGAAATGGATGGAArnTnClAATAT 


1 ft 01 


Qy 


1886 


CAGATGTGACAGACAGGAAGCTGACTATGGAAGAAGAGGAGGCCAAGAGGATAGCAGAGA 


1945 






I 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 II I II II II 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


1802 


CAGATGTGACAGACAGGAAGCTGACTATGGAAGAAGAGGAGGCCAAGAGGATAGCAGAGA 


X O O X 


Qy 


1946 


TGGGAAAGCCAGTATTGGGTGAACACCCCAAACTAGAAGTCATCATTGAAGAGTCCTATG 


2005 






II 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 
1 M 1 M 1 M II II 1 1 1 II II II II II 1 1 II II 1 || I I I I I I || I 1 I I | | | | | | | | | | | | | 




Db 


1862 


TGGGAAAGCCAGTATTGGGTGAACACCCCAAACTAGAAGTCATCATTGAAGAGTrPTATn 


1 9? 1 

X _7 & JL 


Qy 


2006 


AGTTCAAGACTACGGTGGACAAACTGATCAAGAAGACAAACCTGGCCTTGGTTGTGGGGA 


2065 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 
1 1 M 1 1 M M II 1 II II 1 II II 1 II 1 II 1 II II II 1 1 II II 1 II II II M 1 II 1 || 1 I M 




Db 


1922 


AGTTCAAGACTACGGTGGACAAACTGATCAAGAAGArAAArrTnnrrTTnnTTriTnnnnA 

- tr *- N - J x x viinvjnv- x nuuvj x vjvj.rt.v_rtrtrt.v- x vjn. x v_.rtirtvJrtJrtvJrt.v_rtJrtrt.v_.v_ X VJOV>v< X X VJVJ X x VJ X VjvJvjvjrt. 




Qy 


2066 


CCCATTCCTGGAGGGACCAGTTCATGGAGGCCATCACCGTCAGTGCAGCAGGGGATGAGG 


2125 






MMII llllll Mill [Mill III Mill II IIIMIIIIIMIIMI Mill MM 




Db 


1982 


CCCATTCCTGGAGGGACCAGTTCATGGAGGCCATCACCGTCAGTGCAGCAGGGGATGAGG 


2041 


Qy 


2126 


ATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCC 


2185 






1 1 1 1 1 I I I I I I I I I l l l l l l l l l l l I I I 1 I 1 I 1 I I I II i I i i i M ii i i i i i i i i i i i i i 
I I I I I ll l l l l l 1 1 1 1 1 II 11 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 II 1 1 1 1 




Db 


2042 


ATGAGGATGAATCCGGGGAGGAGAGGCTGCCCTCCTGCTTTGArTArGTrATGPArTTrr 


£t JL \J X 


Qy 


2186 


TGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCT 
1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 


2245 


Db 


2102 


II 1 1 II 1 1 1 M 1 1 1 1 II 1 1 1 It 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 
TGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTGrrrrrPAPAflAnTAr , Tnr , rAr , r;nrT 

X VJi^lv* J- X \_ J. X v— J. X V_J V_ _L VJ X X X VJ V_ V_ X VJ X VJ X VJV_ V_ V_ V_ V_ V_rt\_..rtVJ.rtVJ X rt V_ X U*w Lri^wuv X 


Z X O X 


Qy 


2246 


GGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACC 


2305 


Db 




I 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 

II I 1 1 If 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I | | | | | | | | M | | | | | | | | | | | | | | | 




2162 


GGGCCTGCTTCGCCGTCTCCATCCTCATCATTGnPATGPTrArrnPPATrATTnnr;riZVPP 

uvjwv< x x j-v-v_jv_\_\jxv_x v^v^n x v_ v_ x v_rt x v_rt. X X VJVjV_rt. X VJ v_ X V_.rt.v_. v_ Vj v_ v_rt X v_rt. X X OwvJvj.rt.v_. V_ 




Qy 


2306 


TGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCG 


2365 






II 1 1 1 1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 l l 

N II 1 II 1 II 1 1 II 1 II 1 II II II I M M M M 1 II 1 1 II II II 1 II II 1 II II II 




Db 


2222 


TGGCCTCGCACTTPGGPTGPAPPATTGGTPTPAAAf^ATTPAnTPaPanPTnTTnTTTTPr' 

x UUV.V. x v-vjv_.t-i.v_ x x v_vjvjv_ x VJV_rtV_v_rt X x VJVJ iv,l v_.rtrtrt.vjrt. ± X v_rtvJ X v_rt.v_rtvJv_ X vj X X vJ X X X X v_o 


z z o 1 


Qy 


2366 


TGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATG 


2425 






1 1 1 I 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | | | | | | | | || | | | | | M 1 1 1 




Db 


2282 


TGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATG 


2341 


Qy 


2426 


TATATGCAGACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGG 


2485 






1 1 1 1 1 1 1 1 1 MM 1 1 III 1 1 1 1! 1 1 Ml Ml INI 1 1 1 II 1 1 1 1 1 II III III 1 1 III 1 1 




Db 


2342 


TATATGCAGACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGG 


2401 



Qy 


2486 


GCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCC 


2545 






MMI M IMIMMMI III IMMMIII III IMIMIII MUM 




Db 


94 fi 9 


^r^R^^^/^^^T^r^riO^T^^riT^^^^T^ririr^^ri^^AT^/^T^ >^irp/^rp/-i nT\npp jv/itv /-"i/i tv OT»"P/'i/"i 

ol^AlL.<jvaL.L. ILiLjUIw lbultHjlbbLLuULAlt 1 A(_ IvjCjvjL 1 L IvjLAvjoCjACACjCjAoTTCC 


2461 


Qy 


2546 


ACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCT 


2605 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


94 69 

^ *x O Z, 




zbzl 


Qy 


2606 


GCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCC 


2665 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


9 R9 9 




2581 


Ov 
vy 


Z D D O 


oJ.oov_ ibLAAbL lHjLLALAALAKjbL 1L11 iu 1 IjAVjUI~ ILx ILjvjC ILL 1L 1ACA1AL IL 1 


o o o c 

2725 






Ml Ml Mill III MM MM MM MM MIM^MII Ml Mill 




Db 


2582 


GTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCT 


2641 


Qy 


2726 


TTGCCACACTAGAGGCCTATTGCTACATCAAGGGGTTC 2763 








1 MM III Mill II III MIMMMMMM Mill 




Db 


2642 


TTGCCACACTAGAGGCCTATTGCTACATCAAGGGGTTC 2679 
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PI Malyankar UM, Patturajan M, Miller CE # Taupier RJ, Heyes MP, Ju J; 

PI Peyman JA, Catterton E, Macdougall JR, Edinger SR, Stone DJ; 

PI Mazur A; 

XX 

DR WPI; 2003-046862/04. 

DR P-PSDB; ABU12042. 

XX 

PT New isolated NOVX polypeptide useful for treating cardiomyopathy, 

PT atherosclerosis, metabolic disorders, diabetes, obesity, infectious 

PT disease, anorexia, neurodegenerative disorders, Alzheimer's disease and 

PT cancer. 

XX 

PS Claim 3; Page 84; 425pp; English. 
XX 

CC This invention describes novel polypeptides, termed NOVX which have 

CC antidiabetic, antiarteriosclerotic , anorectic, metabolic, antimicrobial, 

CC neuroprotective, antiparkinsonian, antilipaemic , cytostatic, nootropic, 

CC cardiant and immunomodulatory activity. The polypeptide and any 

CC antibodies generated from it are useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease 

CC selected from a pathology associated with the NOVX polypeptide. Fragments 

CC and portions of the polynucleotides encoding NOVX polypeptides are useful 

CC to map the location of NOVX genes on a chromosome, to identify 

CC individuals from minute biological samples, as DNA markers for 

CC restriction fragment length polymorphism (RFLP) , and are useful to 

CC prepare polymerase chain reaction primers. The products of the invention 

CC can be used in gene therapy and for treating cardiomyopathy, metabolic 

CC disorders, diabetes, atherosclerosis, obesity, infectious disease, 

CC anorexia, neurodegenerative disorders, Alzheimer's disease, Parkinson's 

CC disease, immune disorders, haematopoietic disorders, and various 

CC dyslipidaemias, metabolic disturbances associated with obesity, metabolic 

CC syndrome X and wasting disorders associated with chronic diseases and 

CC various cancers. ABX56261-ABX56306 represent the polynucleotide fragments 

CC which encode the NOVX polypeptides represented in ABU12 041-ABU12 086 

XX 

SQ Sequence 2840 BP; 668 A; 700 C; 775 G; 697 T; 0 U; 0 Other; 



Query Match 96.1%; Score 2657.6; DB 8; Length 2840; 

Best Local Similarity 97.6%; Pred. No. 0; 

Matches 2712; Conservative 0; Mismatches 54; Indels 12; Gaps 1; 



Qy 


i 


ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 


60 






1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 
1 1 M M M 1 II M i II 1 II II 1 II II 1 II 1 1 1 II II 1 1 1 1 II 1 II II II II 1 II II II 




Db 


63 


ATGGCGTGGTTAAGGTTGCAGCCTCTCAPPTPTGPPTTPPTPPATTTTPPnPTPPTTAPP 

J- VJVJ v»\j x vjvj J. X fW\J\J J. X VJ V— .fT.v«J 1 1 >— V-. -L v_. J- \J\_» v_, X X V- ^» X v_» V_,.r-i X X X X VJVJVJv X VJvJ X X v_ 


1 99 


Qy 


61 


TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


123 


TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGrsnArnTnPPAAnr' 


1 R9 
x o z 


Qy 


121 


ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTG 


180 






I I i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 i ii i i 1 1 ii i 1 1 ii 1 1 1 1 1 i i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 

I I I 1 1 1 1 1 ii 1 1 I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 




Db 


183 


ACAGGGCAGAACAATGAGTCCTGTTCAGGGTCATCGGACTGPAAGGAGGPTPTPATPPTP 


949 
z *± z, 


Qy 


181 


CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 II 1 1 II 1 II II II II II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 




Db 


243 


CCAATCTGGTAPPPGGAGAACCCTTPPPTTGGGGAPAAPATTfiPPAfinnTPATTPTPTAT 


^09 


Qy 


241 


TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 


300 






II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I I || I || I I I I I | I M | | | | | | | | | 1 M | | | M M 




Db 


303 


TTTGTGGCCCTGATATACATPTTPPTTGGPPTfiTPPATPATTPPTPAPPPPTTPATPnPA 

1 1 J-vji vjvj \_ \ — \_ x x n. x nv.n iui x vJVJVJVJ x vj x v^vrll X X uw X OrivvuL X X v_/A X vjvj V r\ 


^ £9 


Qy 


301 


TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II I II I I I II || | | || | || | | | | | || | M | | | | M M | 




Db 


363 


TCTATTGAAGTPATPAPPTPTP A APAP APPPAPPTPAP A ATT A AP A A APPP A ATPP AH A A 

a v« x x x vjriravj x v~ri x w v_ x v_ X wnnvjriUAUuVJrVVJO X vjfiwfiri X X nn\Jr\f\r\ \* \^ Lnn X VJVj/-iVJr-ir\ 


A 9 9 
r± Z Z 


Qy 


361 


ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 


420 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 II II 




Db 


423 


ACCAGCAPAAPPAPTATTPPPPTPTPPAATPA A APTPTPTPPA APPTPAPPPTTATPPPP 


*± OZ 


Qy 


421 


CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 
1 1 1 1 1 1 1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 II II 1 1 II 1 1 1 




Db 


483 


CTGGGTTPPTPTPPTPPTPAPATAPTPPTPTPTTTA ATTPAPPTPTPTPPTPATPPPTTP 

V- X VJVJVJ X X V, v_ x w X v_jv_ X X v_ v_ X w X X X Innl X Vj.rt.VJVJ 1VJ IVjI VJVJ X V_rt X OkjVj X X lw 


3 ft Z 


Qy 


481 


ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 
1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | 




Db 


543 


ATTPPTPPTP ATPTPPP APPTTPT A PP A TTPT 'APPP APTPP A PPPTTPA & Pa TPTTPBTP 
■"X iVJUl VJVJ X VJrtX v_ X VJVJVjrt.\~V-. lit Xrt.V_V_.rt X X VJ X rtVJVJVJrt.VJ X VJV_rtVjv_V_ X X v#HHwHl VJ X X v_rt X v_ 


o Uz 


Qy 


541 


ATCATTGGCATCTGTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 


600 






l l l I I I I I I I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i 
1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 II 1 1 1 1 II 1 1 1 1 




Db 


603 


ATPATTPPPATPTPTPTPTAPPTPATPPPAPAPPPaPftPAPTPPPaaPATHA nrrnTPTa 
ivjuwrtiv- X vj X vj X v_ irtv.v7l vjrtX v. v. v^HuriL ovj Avj/iuHv, XV_vjv_.rtrtvj.rtX v.AftvjLnlL 1A 


O OZ 


Qy 


601 


CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


660 






IMMMIMM II II IMMIIMI IMMIIMI MMIMIMIIIIMMI MMI 




Db 


663 


CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 


722 


Qy 


661 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 


720 






1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


723 


CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 


782 


Qy 


721 


TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 


780 






1 1 1 1 MM 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 II llllll 1 1 1 1 1 Ml MM 1 III II 1 MM 1 II II 




Db 


783 


TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 


842 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 
Qy 

Db 

Qy 
Db 

Qy 



781 



843 



841 



903 



901 



963 



961 



ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 

MINI 1 1 III 1 1 1 II II II I !l III II II Mill III II II III 1 1 1 II II I II III II 

ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 
CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 

Mill 1 1 ! I II 1 1 1 II II II I Mill II II Mill III 1 1 1 1 III II II I II I II III II 

CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 



840 



902 



900 



962 



960 



CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 

MM 1 1 1 MM I Ml I II M I Mill 1 1 1 1 MIMIII M 1 1 II I II II I II I II Ml II 

CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 1022 



AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 102 0 

III 1 1 MM 1 1 1 1 1 1 1 M II III I II MM Mill Ml II Ml 1 1 Ml 1 1 II I II II I II 

1023 AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 1082 
1021 TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

I I M 1 1 1 1 1 1 1 1 1 M 1 1 1 II I II I Ml 1 1 1 1 1 II II 1 1 1 1 1 II I II 1 1 1 II I M M 1 1 1 

1083 TACTATGCTCTTTCCCACCAACAGAAGAGCCGTGCCTTCTACCGTATCCAAGCCACTCGT 1142 

1081 ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 114 0 

I I I I I I 1 I I I I 1 I 1 I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 t I I I I 1 I I 1 I 1 I 
1143 ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 1202 

1141 TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 1200 

I I I III 1 1 1 II I II I III II I II I II MIMI I M 1 1 1 MIMI I Ml II II I II II 1 1 1 

12 03 TCCAGCATGAGCGAGGTGCACACCGATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTT 1262 

12 01 GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 

I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ] I I I I I I I I I I I 
12 63 GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 132 2 

12 61 AAAGGGGGAGACATGTC AAAGACC ATGTATGTGGACTAC AAAACAGAGGATGGTTCTGCC 132 0 

1 1 1 1 Ml 1 1 II 1 1 1 MM II 1 1 1 1 II MIMI I II I M MIMI 1 1 1 1 II II I II II 1 1 1 

132 3 AAAGGGGGAGACATGTCAAAGACCATGTATGTGGACTACAAAACAGAGGATGGTTCTGCC 1382 
1321 AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 13 80 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 II II II M 1 1 M II 1 1 II 1 1 II 1 1 1 1 1 1 M Ml II I II II 

13 83 AATGCAGGGGCTGACTATGAGTTCACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 1442 
13 81 CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGG AGGATGAACACTTC 144 0 

1 1 Ml I I M I 1 1 1 1 II II I II III 1 1 II II III MM I 1 1 II Ml I 1 1 III II II I M 

144 3 CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 1502 
1441 TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 150 0 

1 1 1 1 1 1 Ml 1 1 1 M 1 1 II MM 1 1 II MIMI I Ml 1 1 II MM MM II II I MM II I 

TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 



1503 



1562 



1501 GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 

1563 GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1622 
1561 GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 162 0 

1 1 1 1 Ml 1 1 II 1 1 1 1 1 II Ml II III 1 1 1 1 Mill MM 1 1 1 II I M 1 1 1 II I II Ml 1 1 

162 3 GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 1682 
1621 GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 1680 



IIIMIIIMIIII IIIIIMMMI ! II I IIIIIIIIIIIIIIIIIIMIII! Mill 

3TCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 1742 



Db 


1683 


>J x x \Jn\Jr\\j i. r\ X X VJu x vj x x x VJVJriuVj X U/lnVJvJ 1 1 L 1 ULuUnvn X V_ttVJVJ X VJU^UUVJVju X 


1 7d 9 
x / *± z 


Qy 


1681 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 


1740 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 
M M 1 M II II II 1 1 II II II II II II II 1 II II II 1 II II II II 1 II 1 1 II 1 1 II 1 II 1 




Db 


1743 


ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGAPAGPPAAGGGTGGPGGTGAGGAPTTT 




Qy 


1741 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 


1800 






MINIMI III MM IMHM MM IIMM INN 1 1 1 




Db 


1803 


GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTCAAAACAATTCACATC 


1862 


Qy 


1801 


AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 


1860 






II II 1 1 1 1 1 1 1 1 1 1 III II 1 1 1 1 1 1 1 1 II 1 1 1 I I I I I 
II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 1 1 




Db 


1863 


AAGGTAATTGATGATGAGGPATATGAGAAAAAPAAnAZXTTAPTTPZVTTPAnaTPZXTPPPP 

****vj vj x x x VJri x vxri x vjttVJVJ v.tt X tt. X vjttOtttttttt^^ttttvjtttt. X X X X X X VjttVJtt. X \Jf\ X uoUL 


1 Q9 9 
-L _? Z Z 


Qy 


1861 


CCGAAATGGATGGAACGTGGAATATCAGATGTG ACAGACAGGAAGCTG 

II 1 1 1 1 1 1 1 III 1 1 1 1 1 1 II 1 1 1 1 1 1 


1908 


Db 


1923 


II Mill 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCCCGCATGGTGGATATGAGTTTTPAGAAAGPGPTPPTGTTATPTPPAGAPAGGAAGPTG 


X J o z 


Qy 


1909 


ACTATGGAAGAAGAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTGGGTGAA 


1968 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 11 1 1 1 11 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 

1 1 II 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 II II 1 II 1 1 II 1 1 




Db 


1983 


ACTATGGAAGAAGAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGPPAGTATTGGGTGAA 


9049 
z \j *± z 


Qy 


1969 


CACCCCAAACTAGAAGTCATCATTGAAGAGTCCTATGAGTTCAAGACTACGGTGGACAAA 


2028 






I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II II 1 II II 1 M 1 1 1 II II II 1 1 1 II 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2043 


CACCCCAAACTAGAAGTPATPATTGAAGAnTPPTATGAGTTPAAGAPTAPGGTPnAPA A A 

n— v» v*.rvrun.v^ x rtVJririvj x v^rt. x v— x X VJtttt.VjttVj lUUl tt. X unU X X V-tttt.VjttV_* X nLuu X OurtV^rtriri 


9 1 09 


Qy 


2029 


CTGATCAAGAAGACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTC 


2088 






1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
M M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2103 


CTGATCAAGAAGACAAACCTGGCCTTGGTTGTGGGGAPPPATTPPTGGAGGfiAPPAGTTP 


9 1 £9 
Z x D Z 


Qy 


2089 


ATGGAGGCCATCACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAG 


2148 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 
II M 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


2163 


ATGGAGGCCATCACPGTPAGTGPAGPAGGGGATGAGGATGAGGATnA ATPPPHPPAPPAn 

J- vjvjnvju^ ^.ri x v-riu\^vj x \_ttVJ X UUnu^nUUOun X unuun X \Jt\\J\Jn X vJrt/\X V_ V_\jVjOVJttAJVjtt,Vj 


9 9 9 9 


Qy 


2149 


AGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTG 


2208 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l I I 1 I I I I I I I I I II I I I I I I I 
1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2223 


AGGPTGPPPTPPTnPTTTGAPTAPPTPATfJPAPTTPPTPAPT'nTPTTPT'nnA APPTPPTP 


O 9 Q O 
Z Z O Z 


Qy 


2209 


TTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATC 


2268 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 M 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I | I I | | | | | | | | | | | | | | | | 




Db 


2283 


TTTGCCTGTGTGCCCCCPAPAGAGTAPTGPPAPGGPTGGGPPTGPTTPGPPnTPTPPATP 


9 ^ A 9 
Z j *± Z 


Qy 


2269 


CTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACC 


2328 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I I 
1 1 1 1 1 1 1 1 II 1 1 1 II II 1 1 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 




Db 


2343 


PTPATP ATTGGPATGPTPAPPGPP ATP A TTGGGP A PPTPPPPTPPP A PTTPPPPTPP A PP 

v- x v^ni v».tt.X X uO\»rtl X \*tt,VwV_VJV-\- tt. X \_ttX X VJVJVJVJtt.v^ K*. XuuLvv X Luv-nv. X X \„\JVJV^ ±VJV-.r\V_V^ 


O A fi *5 

z *t Uz 


Qy 


2329 


ATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCA 


2388 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 
1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 | | 




Db 


2403 


ATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCA 


2462 


Qy 


2389 


GATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGC 


2448 






1 1 1 III 1 II 1 1 1 II 1 II 1 Mill 1 IMIIIIMII 1 1 III II 1 II II 1 II III 1 Mill I 




Db 


2463 


GATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGC 


2522 


Qy 


2449 


AACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTG 


2508 



I : i 1 1 1 . 1 1 M II II M ! 1 1 1 1 1 1 Ill III II! 



Db 


2523 


Qy 


2509 


Db 


2583 


Qy 


2569 


Db 


2643 


Qy 


2629 


Db 


2703 


Qy 




Db 


2763 


Qy 


2749 


Db 


2823 



252 3 AACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTG 2582 
GCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCC 2568 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 Ml II 1 1 1 1 1 1 1 1 1 1 1 

GCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCC 2642 
TTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGA 2 628 

1 1 1 1 1 1 1 1 1 1 M 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 ! 1 1 1 

TTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGA 2 702 
AGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACA 2 688 

I II I II 1 1 1 1 1 1 1 II I III II 1 1 1 II I III 1 1 II 1 1 II 1 1 II II II II I II I MM II I 

AGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACA 2762 
TGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGC 274 8 

M 1 1 1 1 M I M 1 1 1 M M MM 1 1 IN II 1 1 II I M 1 1 1 1 1 1 1 MM I II M 1 1 II II II 

TGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGC 2 822 



MMMMMMMMM 

TACATCAAGGGGTTCTAA 2 84 0 



Search completed: January 22, 2005, 08:02:04 
Job time : 1307.46 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



updates/sec 
Title: 

Perfect score: 

Sequence: 

2766 



January 22, 2005, 07:18:20 ; Search time 226.467 Seconds 

(without alignments) 
8681.377 Million cell 



US-10-054-680-1 
2766 

1 atggcgtggttaaggttgca gctacatcaaggggttctaa 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



824507 seqs, 355394441 residues 



1649014 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents__NA: * 

1 : /cgn2_6/ptodata/l/ina/5A_COMB. seq: * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB. seq: * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB. seq: * 

6 : / cgn2_6/ptodata/l/ina/backfilesl . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being 
printed, 

and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



c 1 


78.8 


2.8 


7218 


1 


US-08-232-463-14 


Sequence 


14, 


Appl 
















2 


76.6 


2.8 


1935 


4 


US-09-701-068-3 


Sequence 


3, 


Appli 
















3 


76.6 


2.8 


1950 


4 


US-09-701-068-1 


Sequence 


1, 


Appli 
















4 


51.6 


1.9 


7218 


1 


US-08-232-463-14 


Sequence 


14, 


Appl 
















5 


50.8 


1.8 


951 


4 


US-09-961-679-3 


Sequence 


3, 



Appli 


















6 


50 


.8 


1. 


8 


1062 


4 


US-09-961-679-5 


Sequence 1 


Appli 


















7 


50 


.8 


1. 


8 


1812 


4 


US-09-961-679-1 


Sequence ; 


Appli 


















8 


50 


.8 


1. 


8 


2366 


4 


US-09-961-679-7 


Sequence 


Appli 


















c 9 


50 


.2 


1. 


8 


390 


3 


US-09-197-649-7 


Sequence 


Appli 


















10 


46 


.2 


1. 


7 


3984 


4 


US-09-016-434-1199 


Sequence 


1199, Ap 


















11 


46 


.2 


1. 


7 


4473 


4 


US-09-799-451-118 


Sequence 


118, App 


















12 


46 


.2 


1. 


7 


4559 


4 


US-09-919-172-61 


Sequence i 


Appl 


















13 


44 


.8 


1. 


6 


2803 


4 


US-09-701-068-4 


Sequence - 


Appli 


















c 14 


44 


.2 


1. 


6 


4403765 


3 


US-09-103-840A-2 


Sequence 


Appli 


















c 15 


43 


.8 


1. 


6 


1992 


4 


US-09-252-991A-9693 


Sequence 


9693, Ap 


















16 


43 


.8 


1. 


6 


2658 


4 


US-09-252-991A-9558 


Sequence 


9558, Ap 


















17 


43 


.8 


1. 


6 


2799 


4 


US-09-252-991A-9604 


Sequence 


9604, Ap 


















c 18 


43 


.4 


1. 


6 


4411529 


3 


US-09-103-840A-1 


Sequence 


Appli 


















19 


42 


.8 


• 1. 


5 


1174 


4 


US-09-270-767-25428 


Sequence 


25428, A 


















20 


42 


.8 


1. 


5 


1612 


4 


US-09-270-767-10086 


Sequence 


10086, A 


















21 


42 


.2 


1. 


5 


660 


4 


US-09-252-991A-5866 


Sequence 


5866, Ap 


















22 


42 


.2 


1. 


5 


1983 


4 


US-09-252-991A-5825 


Sequence 


5825, Ap 


















c 23 


42 


.2 


1. 


5 


1995 


4 


US-09-252-991A-5783 


Sequence 


5783, Ap 


















24 


41 


.8 


1. 


5 


870 


4 


US-09-252-991A-3650 


Sequence 


3650, Ap 


















25 


41 


.8 


1. 


5 


1098 


4 


US-09-252-991A-3605 


Sequence 


3605, Ap 


















c 26 


41 


.8 


1. 


5 


3402 


4 


US-09-252-991A-1374 


Sequence 


1374, Ap 


















27 


41 


• 8 


1. 


5 


3687 


4 


US-09-252-991A-1193 


Sequence 


1193, Ap 


















28 


41 


.8 


1. 


5 


4266 


4 


US-09-252-991A-1234 


Sequence 


1234, Ap 


















29 


41 


.4 


1. 


5 


4403765 


3 


US-09-103-840A-2 


Sequence 


Appli 


















30 


41 


.4 


1. 


5 


4411529 


3 


US-09-103-840A-1 


Sequence 


Appli 


















31 


40 


.6 


1. 


5 


2036 


4 


US-09-799-451-772 


Sequence 


772, App 


















32 


40 


.2 


1. 


5 


1174 


3 


US-09-034-985-1 


Sequence 


Appli 


















33 




40 


1. 


4 


570 


4 


US-09-252-991A-10419 


Sequence 


10419, A 


















c 34 




40 


1. 


4 


789 


4 


US-09-252-991A-10654 


Sequence 


10654, A 


















c 35 




40 


1. 


4 


813 


4 


US-09-252-991A-10547 


Sequence 



10547, A 



36 


40 


1. 


4 


831 


4 


US- 


-09- 


-252- 


•991A-10512 


Sequence 


10512, A 






















37 


39.6 


1. 


4 


1827 


4 


US- 


•09- 


-270- 


•767-1308 


Sequence 


1308, Ap 






















38 


39. 6 


1. 


4 


1827 


4 


US- 


•09- 


-270- 


-767-16590 


Sequence 


16590, A 






















39 


39.6 


1. 


4 


3431 


4 


US- 


-09- 


-221- 


•017B-993 


Sequence 


993, App 






















40 


39.2 


1. 


4 


333 


4 


us- 


09- 


-252- 


•991A-3700 


Sequence 


3700, Ap 






















c 41 


39 


1. 


4 


642 


4 


us- 


•09- 


-252- 


-991A-4949 


Sequence 


4949, Ap 






















42 


39 


1. 


4 


1617 


4 


us- 


•09- 


-489- 


•039A-2262 


Sequence 


2262, Ap 






















43 


39 


1. 


4 


1926 


4 


us- 


09- 


-249- 


•585A-4 


Sequence 


Appli 






















44 


39 


1. 


4 


1931 


2 


us- 


•09- 


-130- 


•114-2 


Sequence 


Appli 






















45 


39 


1. 


4 


1932 


4 


us- 


09- 


-252- 


-991A-4998 


Sequence 



4998, Ap 



ALIGNMENTS 



RESULT 1 

US-08-232-463-14/C 

; Sequence 14, Application US/08232463 
; Patent No. 5670367 
; GENERAL INFORMATION: 
; APPLICANT: DORNER, F. 
; APPLICANT: SCHEIFLINGER, F. 
APPLICANT: FALKNER, F. G. 

TITLE OF INVENTION: RECOMBINANT FOWLPOX VIRUS 
NUMBER OF SEQUENCES: 52 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Foley & Lardner 

STREET: 1800 Diagonal Road, Suite 500 
CITY: Alexandria 
STATE: VA 
COUNTRY: USA 
ZIP: 22313-0299 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.25 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/232,463 

FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: US/07/935,313 

FILING DATE: 
; APPLICATION NUMBER: EP 91 114 300.6 

FILING DATE: 26-AUG-1991 
ATTORNEY/AGENT INFORMATION: 
NAME: BENT, Stephen A. 
REGISTRATION NUMBER: 29,768 



REFERENCE/ DOCKET NUMBER: 30472/114 IMMU 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (703)836-9300 
TELEFAX: (703)683-4109 
TELEX: 899149 
; INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 7218 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 
CLONE: pTZgpt-Fls 
US-08-232-463-14 

Query Match 2.8%; Score 78.8; DB 1; Length 7218; 

Best Local Similarity 3.8%; Pred. No. 6.9e-13; 

Matches 14; Conservative 231; Mismatches 123; Indels 0; 
Gaps 0; 

Qy 1678 

GGT ACAGT CAT C GT C CCCTTT AGGACAGT AGAAGGGACAGC CAAGGGT GGCGGT GAGGAC 1737 

I I I I I III I I I I I I ::::::::::: : ::::::::: 



Db 1460 

GT TAAAGAGAT AGAAGAATTT GGTAC RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1401 
Qy 1738 

TTT GAAGACAC AT ATGGGGAGTT GGAATT CAAGAATGAT GAAACT GT GAAAACCATAAGG 

..... . . . ...... .... ..... .. .... . 

..... . . . ...... .... ..... .. .... . 

Db 1400 

RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 
Qy 1798 

GTT AAAAT AGT AGAT GAGGAGGAAT ACGAAAGGCAAGAGAAT TT CTT CATT GCC CTT GGT 1857 



Db 1340 

RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1281 
Qy 1858 

GAACC GAAAT GGAT GGAAC GT GGAAT AT C AGAT GT GACAGACAGGAAGCT GACT ATGGAA 1917 



Db 1280 

RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1221 
Qy 1918 

GAAGAGGAGGCCAAGAGGAT AGCAGAGAT GGGAAAGCCAGT ATT GGGT GAACAC C CCAAA 1977 



Db 1220 

RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1161 
Qy 1978 

CT AGAAGT CAT C AT TGAAGAGT C CT AT GAGT T CAAGACT AC GGT GGACAAACT GAT CAAG 2037 



Db 1160 



RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1101 



Qy 2038 AAGACAAA 2045 

Db 1100 RRRRRRRR 1093 



RESULT 2 
US-09-701-068-3 

; Sequence 3, Application US/09701068 

; Patent No. 6677506 

; GENERAL INFORMATION: 

; APPLICANT: Galil, Gad et al . 

; TITLE OF INVENTION: DNA CODING FOR A Mg2+/H+ OR Zn2+/H+ EXCHANGER AND 
TRANSGENIC PLANTS 

; TITLE OF INVENTION: EXPRESSING SAME 
; FILE REFERENCE: 01/21317 

; CURRENT APPLICATION NUMBER: US/09/701, 068 

; CURRENT FILING DATE: 2001-05-07 

; NUMBER OF SEQ ID NOS : 17 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 3 

LENGTH: 1935 

TYPE: DNA 

ORGANISM: Arabidopsis thaliana 
FEATURE: 
NAME/ KEY: CDS 
LOCATION: ( 136) ..( 1755 ) 
US-09-701-068-3 



Query Match 2.8%; Score 76.6; DB 4; Length 1935; 

Best Local Similarity 51.3%; Pred. No. 1.2e-12; 

Matches 178; Conservative 0; Mismatches 169; Indels 0; 
Gaps 0; 

Qy 2174 

TCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGT 2233 

II I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I 

Db 1169 

TCTGGCATTTACTCCTCGCCCCTTGGAAACTGCTTTTTGCATTTGTGCCCCCCTGCAACA 1228 
Qy 2234 

ACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCA 2293 

I I I I I I I I I III I III I I I I I I I I I I I II 

Db 1229 

TTGCTCACGGTTGGATCGCTTTCATCTGCTCTCTCCTCTTCATCAGTGGAGTAGCCTTTG 1288 



Qy 2294 

T CATT GGGGACCT GGCCT CGCACTT C GGCT GCACC AT T GGT CT CAAAGATT CAGT CACAG 2353 

II II I I I I I I I I I I I I I I I I I I 

I I 

Db 1289 

TT GT CACAAGATTT ACT GAC CT T AT AAGCT GT GT CACT GGAATAAACCCAT AT GT GAT AG 1348 



Qy 2354 

CTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAAGCTGCTG 2413 
I I I I I I II I I I II I I I I I I II II I 

I I I I 

Db 1349 

CAT T CAC AGCACT C GCAAGT GGAACT TCAT GGC CAGACT T AGT AGCAAGTAAAAT CGCTG 1408 



Qy 2414 

C CCT CCAGGAT GT AT AT GCAGAC GCCT CCATT GGCAACGT GAC GGGC AGCAACGCCGT CA 2473 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1409 

CAGAGCGACAACTAACC GC AGAT T CAGCT ATT GCAAACAT CACCT GCAGTAACT CGGT GA 1468 

Qy 2474 ATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 2520 

I III I I I I I I I I I I I I I I II I I I I I 

Db 1469 ACAT CTATGT GGGGATTGGAGTT CCGTGGCTGATAAACACAGTCTAC 1515 



RESULT 3 
US-09-701-068-1 

; Sequence 1, Application US/09701068 

; Patent No. 6677506 

; GENERAL INFORMATION: 

; APPLICANT: Galil, Gad et al . 

; TITLE OF INVENTION: DNA CODING FOR A Mg2+/H+ OR Zn2+/H+ EXCHANGER AND 
TRANSGENIC PLANTS 

; TITLE OF INVENTION: EXPRESSING SAME 
; FILE REFERENCE: 01/21317 

; CURRENT APPLICATION NUMBER: US/09/701,068 

; CURRENT FILING DATE: 2001-05-07 

; NUMBER OF SEQ ID NOS : 17 

; SOFTWARE: Patent In version 3.0 

; SEQ ID NO 1 

LENGTH: 1950 
; TYPE: DNA 

; ORGANISM: Arabidopsis thaliana 
US-09-701-068-1 

Query Match 2.8%; Score 76.6; DB 4; Length 1950; 

Best Local Similarity 51.3%; Pred. No. 1.2e-12; 
Matches 178; Conservative 0; Mismatches 169; Indels 0; 
Gaps 0; 

Qy 2174 

TCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGT 2233 

II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1174 

TCTGGCATTTACTCCTCGCCCCTTGGAAACTGCTTTTTGCATTTGTGCCCCCCTGCAACA 1233 
Qy 2234 

ACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCA 2293 

I I I I I I I I I III I III I I I I I I I I I I I II 

Db 1234 

TTGCTCACGGTTGGATCGCTTTCATCTGCTCTCTCCTCTTCATCAGTGGAGTAGCCTTTG 1293 
Qy 2294 

T CAT T GGGGACCT GGCCT CGC ACT T C GGCT GC AC C ATT GGT CT CAAAGATT CAGT CACAG 2353 

II II I I I I I I MINIM I I I I 

I I 

Db 1294 

TT GT CACAAGATT T ACT GACCTT ATAAGCT GT GT CACT GGAATAAACC CAT AT GT GAT AG 1353 
Qy 2354 

CTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAAGCTGCTG 2413 

I I I I I I I I I I I I I I I I I I I I I I I I I I 



1 1 1 1 

Db 1354 

CATTCACAGCACTCGCAAGT GGAACTTCAT GGCCAGACTTAGTAGCAAGTAAAATCGCTG 1413 
Qy 2414 

C C CT CC AGGATGT AT ATGCAGAC GC CT CCATT GGCAAC GT GAC GGGCAGCAACGCCGT C A 2473 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1414 

CAGAGCGACAACTAACC GCAGAT T C AGCT AT T GCAAAC AT CAC CTGCAGTAACT C GGT GA 1473 

Qy 2474 ATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 2520 

I III I I I I I I I I I I I I I I II I I I I I 

Db 1474 ACAT CT AT GT GGGGAT T GGAGTT C CGT GGCTGATAAACACAGT CT AC 1520 



RESULT 4 

US-08-232-463-14 

; Sequence 14, Application US/08232463 

; Patent No. 5670367 

; GENERAL INFORMATION: 

APPLICANT: DORNER, F. 

APPLICANT: SCHEIFLINGER, F. 

APPLICANT: FALKNER, F. G. 

TITLE OF INVENTION: RECOMBINANT FOWLPOX VIRUS 
NUMBER OF SEQUENCES: 52 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Foley & Lardner 

STREET: 1800 Diagonal Road, Suite 500 
; CITY: Alexandria 

STATE: VA 
; COUNTRY: USA 

; ZIP: 22313-0299 

COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/232,463 

FILING DATE: 
; CLASSIFICATION: 435 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/07/935,313 

FILING DATE: 

APPLICATION NUMBER: EP 91 114 300.6 
; FILING DATE: 26-AUG-1991 

ATTORNEY/AGENT INFORMATION: 

NAME: BENT, Stephen AV 

REGISTRATION NUMBER: 29,768 
; REFERENCE/ DOCKET NUMBER: 30472/114 IMMU 

; TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (703)836-9300 

TELEFAX: (703)683-4109 

TELEX: 899149 
; INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 7218 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: single 



; TOPOLOGY: linear 

IMMEDIATE SOURCE: 
; CLONE: pTZgpt-Fls 

US-08-232-463-14 

Query Match 1.9%; Score 51.6; DB 1; Length 7218; 

Best Local Similarity 5.9%; Pred. No. 0.00017; 

Matches 24; Conservative 215; Mismatches 169; Indels 0; 
Gaps 0 ; 

Qy 395 

CTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTT 454 



Db 1078 

Y YY YYY YYYYYYY YYYYYYYYYYYYY YYYYYYYYYYYYYYYY YYYYYYYYYYYYYY YYYY 1137 
Qy 455 

TAAT T GAGGT GT GT GGT CAT GGGT T CAT T GCT GGT GAT CT GGGAC CT T CT ACC ATT GT AG 514 

• .. ... ... ....... • ... •»•••• 

• .. ... ... ....... • ... ...... 

Db 1138 

YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY Y Y YYYYYYYYYYYYYYYYYYYYYYYYY 1197 
Qy 515 

GGAGT GC AGCCT TCAAC AT GT T CAT CAT CAT T GGCAT CT GT GT CT ACGT GAT C C C AGACG 574 



Db 1198 

YYYY YYY YYY YYY YYY YYY YYY YYY YYY YYY YYY YYY YYY YYYY YYY YYY YYY YYY YYYY 1257 
Qy ' 575 

GAGAGACT CGCAAGAT CAAGCAT CT AC GAGT CTT CTT CAT CACC GCT GCT T GGAGT AT CT 634 



Db 1258 

YYYY YYY YYY YYY YYYY YYY YYY YYY YYYY YYY YYY YYYY YYYYYYY YYYY YYY YYY YYY 1317 
Qy 635 

TTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTT 694 



Db 1318 

YY YYYYYYY YYY YYY YYYY YYY YYY YYY YYY YYYY YYY YYY YYY YYYY YYY YYY YYYY YY 1377 
Qy 695 

GGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAG 754 
Db 1378 

Y YYY YYY YYY YYY YYY YYY YYY YYY YYY YYY YYY YYY YYY YYY YYY YYYY YYY YYY YGTA 1437 

Qy 755 ATAAACGACT GCT CTT CTACAAATACATGCACAAAAAGT ACCGCACAG 802 

III I III I I I I I II I I I I II I I I I 

Db 1438 C CAAATT CT T CT AT CT CTTT AACT ACTT GCAT AGAT AGGTAATT ACAG 1485 



RESULT 5 
US-09-961-679-3 

; Sequence 3, Application US/09961679 
; Patent No. 6787352 
; GENERAL INFORMATION: 



; APPLICANT: Friddle, Carl Johan 
; APPLICANT: Gerhardt, Brenda 

; TITLE OF INVENTION: No. 6787352el Human Ion-Exchanger Proteins and 

Polynucleotides Encoding the 

; TITLE OF INVENTION: Same 

; FILE REFERENCE: LEX-0239-USA 

; CURRENT APPLICATION NUMBER: US/09/961, 679 

; CURRENT FILING DATE: 2001-09-24 

; PRIOR APPLICATION NUMBER: US 60/235,745 

; PRIOR FILING DATE: 2000-09-27 

; NUMBER OF SEQ ID NOS : 7 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 3 

LENGTH: 951 

TYPE: DNA 
; ORGANISM: homo sapiens 
US-09-961-679-3 



Query Match 1.8%; 
Best Local Similarity 51.9%; 
Matches 140; Conservative 
Gaps 1; 



Score 50.8; DB 4; Length 951; 
Pred. No. 6.7e-05; 
0; Mismatches 127; Indels 3; 



Qy 2247 

GGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCT 2306 

III I I II I I I I I II III I I I I I I I I I I I I I I 

I I 

Db 444 

GGT CACCTT CAT CAC C GC CACGCT GT GGAT C GCT GT GTTCT CCT ACATCATGGT GT GGCT 503 
Qy 2307 

GGCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGT 2366 

II I I I I I I I I I I I I I I I III III I II I 

I 

Db 504 

GGT GACT ATT AT CGGAT AC AC ACTT GGGAT C C CGGAT GT CATCAT GGGCATT ACT TT CCT 563 
Qy 2367 

GGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATGT 2426 

I I I I I I I I II I I I I I I I I I I I I I I II II II 

I 

Db 564 

GGCAGCAGGGACAAGTGTT CCAGACT GCATGGCCAGCCTAATTGTGGCGAGACAAGGCCT 623 
Qy 2427 

ATATGCAGACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGG 248 6 



Db 624 

T GGGGAC AT GGC AGT CTCCAACACC AT AGGAAGCAACGT GTTT GAC AT CCT GGT AGG 680 

Qy 2487 CATCGGCCTGGCCTGGTCCGTGGCCGCCAT 2516 

III I I I I I III I I I I 

Db 681 ACTTGGTGTACCGTGGGGCCTGCAGACCAT 710 



RESULT 6 
US-09-961-679-5 

; Sequence 5, Application US/09961679 
; Patent No. 6787352 



GENERAL INFORMATION: 
APPLICANT: Friddle, Carl Johan 
APPLICANT: Gerhardt, Brenda 

TITLE OF INVENTION: No. 6787352el Human Ion-Exchanger Proteins and 
Polynucleotides Encoding the 
TITLE OF INVENTION: Same 
FILE REFERENCE: LEX-0239-USA 
CURRENT APPLICATION NUMBER: US/09/961 , 679 
CURRENT FILING DATE: 2001-09-24 
PRIOR APPLICATION NUMBER: US 60/235,745 
PRIOR FILING DATE: 2000-09-27 
NUMBER OF SEQ ID NOS : 7 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 5 
LENGTH: 1062 
TYPE: DNA 

ORGANISM: homo sapiens 
US-09-961-679-5 

Query Match 1.8%; Score 50.8; DB 4; Length 1062; 

Best Local Similarity 51.9%; Pred. No. 7.3e-05; 
Matches 14 0; Conservative 0; Mismatches 127; Indels 3; 
Gaps 1; 

Qy 2247 

GGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCT 2306 

III I II I I I I I I II III I I I I I I I I I I I I I I 

I I 

Db 555 

GGTCACCTTCATCACCGCCACGCTGTGGATCGCTGTGTTCTCCTACATCATGGTGTGGCT 614 
Qy 2307 

GGCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGT 2366 

II I I I I I I I I I I I I I II III III I II I I I I 

I 

Db 615 

GGTGACTATTATCGGAT ACACACTT GGGATCCCGGAT GTCAT CAT GGGCATTACTTTCCT 674 
Qy 2367 

GGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATGT 2426 

I I II I I I I I I I I I I I I I I I I I I I I II II II I 

I 

Db 675 

GGCAGCAGGGACAAGT GTT CCAGACT GC AT GGCCAGC CTAATT GT GGC GAGACAAGGC CT 734 
Qy 2427 

ATATGCAGACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGG 2486 

II III I I I I I I I I I I I I I I I I I I III I 

II 

Db 735 — 

TGGGGACAT GGCAGTCT CCAACACCATAGGAAGCAACGT GTTTGACATCCT GGTAGG 791 

Qy 2487 CATCGGCCTGGCCTGGTCCGTGGCCGCCAT 2516 

III I I I I I III I I I I 

Db 792 ACTTGGTGTACCGTGGGGCCTGCAGACCAT 821 



RESULT 7 
US-09-961-679-1 

; Sequence 1, Application US/09961679 



; Patent No. 6787352 
; GENERAL INFORMATION: 
; APPLICANT: Friddle, Carl Johan 
; APPLICANT: Gerhardt, Brenda 

; TITLE OF INVENTION: No. 6787352el Human Ion-Exchanger Proteins and 
Polynucleotides Encoding the 
; TITLE OF INVENTION: Same 
; FILE REFERENCE: LEX-0239-USA 
; CURRENT APPLICATION NUMBER: US/09/961,679 
; CURRENT FILING DATE: 2001-09-24 
; PRIOR APPLICATION NUMBER: US 60/235,745 
; PRIOR FILING DATE: 2000-09-27 
; NUMBER OF SEQ ID NOS : 7 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 1 

LENGTH: 1812 

TYPE: DNA 
; ORGANISM: homo sapiens 
US-09-961-679-1 

Query Match 1.8%; 
Best Local Similarity 51.9%; 
Matches 140; Conservative 
Gaps 1; 

Qy 2247 

GGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCT 2306 

III I I I I I I I I I II III I I I I I I I I I I I I I I 

I I 

Db 1305 

GGT C AC CTT CAT C AC C GC CAC GCT GT GGAT C GCT GT GT TCT CCT ACATCAT GGT GT GGCT 1364 
Qy 2307 

GGCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGT 2366 

II I I I I I I I I I I I I I II III III I II I I I I 

I 

Db 1365 

GGT GACT ATT AT CGGAT ACACACT T GGGAT CC CGGAT GT CAT CATGGGCATT ACTT T CCT 1424 
Qy 2367 

GGCATT T GGCACCT CT GT C CCAGATAC GTTT GCCAGCAAAGCT GCT GCC CT C C AGGAT GT 2426 



Db 

GGCAGCAGGGACAAGTGTTCCAGACTGCATGGCCAGCCTAATTGTGGCGAGACAAGGCCT 14 84 
Qy 2427 

ATATGCAGACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGG 2486 

II Ml I I I II I I I I I I I I I I I I I III I 

I I 

Db 1485 — 

T GGGGAC AT GGCAGT CT C CAACAC C AT AGGAAGCAACGT GT TT GACATCCT GGT AGG 1541 

Qy 2487 CATCGGCCT GGCCT GGTCCGT GGCCGCCAT 2516 

III I I I I I III I I I I 

Db 1542 ACTTGGTGTACCGTGGGGCCTGCAGACCAT 1571 



Score 50.8; DB 4; Length 1812; 
Pred. No. 0.00011; 
0; Mismatches 127; Indels 3; 



RESULT 8 
US-09-961-679-7 



Sequence 7 , Application US/09961679 
Patent No. 6787352 
GENERAL INFORMATION: 
APPLICANT: Friddle, Carl Johan 
APPLICANT: Gerhardt, Brenda 

TITLE OF INVENTION: No. 6787352el Human Ion-Exchanger Proteins and 
Polynucleotides Encoding the 
TITLE OF INVENTION: Same 
FILE REFERENCE: LEX-0239-USA 
CURRENT APPLICATION NUMBER: US/ 09/961, 67 9 
CURRENT FILING DATE: 2001-09-24 
PRIOR APPLICATION NUMBER: US 60/235,745 
PRIOR FILING DATE: 2000-09-27 
NUMBER OF SEQ ID NOS : 7 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 7 
LENGTH: 2366 
TYPE: DNA 

ORGANISM: homo sapiens 
US-09-961-679-7 

Query Match 1.8%; Score 50.8; DB 4; Length 2366; 

Best Local Similarity 51.9%; Pred. No. 0.00013; 
Matches 140; Conservative 0; Mismatches 127; Indels 3; 
Gaps 1; 

Qy 2247 

GGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCT 2306 

III I I I I I I I I I II III II I I I I II I I I I I I 

I I 

Db 1462 

GGTCACCTTCATCACCGCCACGCTGTGGATCGCTGTGTTCTCCTACATCATGGTGTGGCT 1521 
Qy 2307 

GGCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGT 2366 

II I I I I I I I II I I I I II III III I II I I I I 

I 

Db 1522 

GGT GACT ATT ATCGGAT ACACACTT GGGAT CCCGGAT GT CAT CAT GGGC ATT ACTTT C CT 1581 
Qy 2367 

GGCAT TT GGCACCT CT GTC C CAGAT ACGT TT GC CAGCAAAGCT GCT GCCCT CCAGGAT GT 2426 

I I I I I I II I I I I I I I I I II I I I I I II II III 

I 

Db 1582 

GGCAGCAGGGACAAGT GTT CCAGACT GCATGGC CAGC CT AATT GT GGCGAGACAAGGCCT 1641 
Qy 2427 

AT AT GCAGAC GCCT CC ATT GGCAACGT GAC GGGC AGCAAC GC C GT CAAT GT CTT C CT GGG 2486 

II III I I I I I I I I I I I I I I I I I I III I 

I I 

Db 1642 — 

T GGGGACATGGCAGTCTCCAACACCATAGGAAGCAACGT GTTTGACATCCT GGT AGG 1698 

Qy 2487 CATCGGCCTGGCCTGGTCCGTGGCCGCCAT 2516 

I II I I I I I III I I I I 

Db 1699 ACTTGGTGTACCGTGGGGCCTGCAGACCAT 1728 



RESULT 9 



US-09-197-649-7/c 

; Sequence 7, Application US/09197649 

; Patent No. 6194550 

; GENERAL INFORMATION: 

; APPLICANT: Gold, Larry 

; APPLICANT: Tuerk, Craig 

; APPLICANT: Pribnow, David 

; APPLICANT: Smith, Jonathan D. 

; TITLE OF INVENTION: Systematic Polypeptide Evolution by Reverse 
Translation 

; FILE REFERENCE: NEX02/C1-CON 

; CURRENT APPLICATION NUMBER: US/09/197,649 

; CURRENT FILING DATE: 1998-11-23 

; EARLIER APPLICATION NUMBER: 07/829,461 

; EARLIER FILING DATE: 1992-01-31 

; EARLIER APPLICATION NUMBER: 07/739,055 

; EARLIER FILING DATE: 1991-08-01 

; EARLIER APPLICATION NUMBER: 07/561,968 

; EARLIER FILING DATE: 1990-08-02 

; NUMBER OF SEQ ID NOS : 26 

; SOFTWARE: Patentln Ver. 2.0 

; SEQ ID NO 7 

LENGTH: 390 

TYPE: DNA 
; ORGANISM: Artificial Sequence 

FEATURE : 

; OTHER INFORMATION: Description of Artificial Sequence: Sequence 

OTHER INFORMATION: having a 120 repeat of ACG flanked by fixed 
; OTHER INFORMATION: fragments having Ncol restriction sites. 
US-09-197-649-7 

Query Match 1.8%; Score 50.2; DB 3; Length 390; 

Best Local Similarity 46.0%; Pred. No. 5.2e-05; 
Matches 169; Conservative 0; Mismatches 198; Indels 0; 
Gaps 0; 

Qy 2238 

CCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCAT 2297 



Db 375 

CCATGGCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGT 316 
Qy 2298 

T GGGGACCTGGCCT CGCACTTCGGCTGCACCATTGGT CTCAAAGATT CAGT CACAGCTGT 2357 

I I I I I I I I I I I I I I I II I III I 

I I 

Db 315 

CGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGT 256 
Qy 2358 

TGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCT 2417 

I I I I I I I I I I I III I I I I I I I I I 

I 

Db 255 

CGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGT 196 



Qy 2418 

C C AGGAT GT AT AT GC AGACGCCT C CAT T GGCAAC GT GACGGGCAGCAAC GCCGT CAAT GT 2477 

I III I I I I I I I I I II I I I I I I I I I I 



Db 195 

CGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGT 136 
Qy 2478 

CTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACA 2537 

I I I I I I I II I I I I - I I I I I I I I I I I I I 

Db 135 

CGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGT 7 6 
Qy 2538 

GGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGC 2597 

I II III I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 75 

CGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGTCGT 16 

Qy 2598 ATTTGTC 2604 

I I I I 

Db 15 CGTCGTC 9 



RESULT 10 

US-09-016-434-1199 

; Sequence 1199, Application US/09016434 

; Patent No. 6500938 

; GENERAL INFORMATION: 

; APPLICANT: Janice Au- Young 

APPLICANT: Jeffrey J. Seilhamer 

TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF SIGNALING 
TITLE OF INVENTION: PATHWAY GENE EXPRESSION 
NUMBER OF SEQUENCES: 1490 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 
STREET: 3174 PORTER DRIVE 
CITY: PALO ALTO 
; STATE: CALIFORNIA 

COUNTRY: USA 
; ZIP: 94304 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Word Perfect 6.1 for Windows/MS-DOS 6.2 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/016,434 

FILING DATE: HEREWITH 

CLASSIFICATION: 
PRIOR APPLICATION DATA': 

APPLICATION NUMBER: 

FILING DATE: 

CLASSIFICATION: 
; ATTORNEY/AGENT INFORMATION: 

NAME: Zeller, Karen J. 

REGISTRATION NUMBER: 37,071 

REFERENCE/ DOCKET NUMBER: PA- 00 02 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (650) 855-0555 

TELEFAX: (650) 845-4166 
; INFORMATION FOR SEQ ID NO: 1199: 



SEQUENCE CHARACTERISTICS: 
LENGTH: 3984 base pairs 
TYPE: nucleic acid 
; STRANDEDNESS: single 

; TOPOLOGY: linear 

; IMMEDIATE SOURCE: 
; LIBRARY: GENBANK 

; CLONE: gl81907 

US-09-016-434-1199 

Query Match 1.7%; 
Best Local Similarity 51.7%; 
Matches 105; Conservative 
Gaps 0; 



Score 46.2; DB 4; Length 3984; 
Pred. No. 0.0052; 
0; Mismatches 98; Indels 0; 



Qy 2434 

GACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGC 2493 
II I I I II I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 674 

GCCTTCTCCAAGGACATCTTCTCCGTCATCAACTTCTTCAGCTTCTTCAACTGGCTCTGC 733 
Qy 2494 

CTGGCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCG 2553 



Db 734 

GTGGCCCTGGCCATCATCGGCATGATCTGGCTGCGCCACAGAAAGCCTGAGCTTGAGCGG 793 
Qy 2554 

GCCGGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGC 2613 
II II II II III I I I I I I I I I I I I I III 

I 

Db 794 

CCCATCAAGGTGAACCTGGCCCTGCCTGTGTTCTTCATCCTGGCCTGCCTCTTCCTGATC 853 

Qy 2614 GTGCTCTTGTACCGAAGGCGGCC 2636 

I I I I I I I I I II 

Db 854 GCCGTCTCCTTCTGGAAGACACC 876 



RESULT 11 
US-09-799-451-118 

Sequence 118, Application US/09799451 
Patent No. 6783969 
GENERAL INFORMATION: 
APPLICANT: Tang, Y. Tom 
APPLICANT: Zhou, Ping 
APPLICANT: Goodrich, Ryle 
APPLICANT: Asundi, Vinod 
APPLICANT: Ren, Feiyan 
APPLICANT: Zhang, Jie 
APPLICANT: Xue, Aidong J. 
APPLICANT: Zhao, Qing A. 
APPLICANT: Wang, Jian-Rui 
APPLICANT: Ma, Yunqing 
APPLICANT: Yamazaki, Victoria 
APPLICANT: Chen, Rui-hong 
APPLICANT: Wang, Zhiwei 
APPLICANT: Wang, Dunrui 



APPLICANT: Yang, Yonghong 
APPLICANT: Wehrman, Tom 
APPLICANT: Ghosh, Reena 
APPLICANT: Drmanac, Radoje T. 

TITLE OF INVENTION: No. 6783969el Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 803 

CURRENT APPLICATION NUMBER: US/09/799,451 
CURRENT FILING DATE: 2001-03-05 
NUMBER OF SEQ ID NOS : 948 
SOFTWARE: pt_FL_genes Version 2.0 
SEQ ID NO 118 
LENGTH: 4473 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME/ KEY: CDS 
LOCATION: (67) . . (1509) 
US-09-799-451-118 

Query Match 1.7%; Score 46.2; DB 4; Length 4473; 

Best Local Similarity 51.7%; Pred. No. 0.0057; 

Matches 105; Conservative 0; Mismatches 98; Indels 0; 
Gaps 0; 

Qy 2434 

GACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGC 2493 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I 

Db 1150 

GCCTTCTCCAAGGACATCTTCTCCGTCATCAACTTCTTCAGCTTCTTCAACTGGCTCTGC 1209 
Qy 2494 

CTGGCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCG 2553 

I I I I I I I I I I I I I I I I I I I II I I I I I II 

I 

Db 1210 

GTGGCCCTGGCCATCATCGGCATGATCTGGCTGCGCCACAGAAAGCCTGAGCTTGAGCGG 1269 
Qy 2554 

GCCGGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGC 2613 

II II II II III I I I I I I I I I I I I I I I I I I I 

I 

Db 1270 

CCCATCAAGGTGAACCTGGCCCTGCCTGTGTTCTTCATCCTGGCCTGCCTCTTCCTGATC 1329 

Qy 2614 GTGCTCTTGTACCGAAGGCGGCC 2636 

I III I I I I I II 
Db 1330 GCC GT CT CCT T CT GGAAGACAC C 1352 



RESULT 12 
US-09-919-172-61 

; Sequence 61, Application US/09919172 

; Patent No. 6673545 

; GENERAL INFORMATION: 

; APPLICANT: Faris, Mary 

; APPLICANT: Turner, Christopher M. 

; TITLE OF INVENTION: PROSTATE CANCER MARKERS 

; FILE REFERENCE: PA- 0036 US 



; CURRENT APPLICATION NUMBER: US/09/919,172 

; CURRENT FILING DATE: 2001-07-30 

; PRIOR APPLICATION NUMBER: 60/222,469 

; PRIOR FILING DATE: 2000-07-28 

; NUMBER OF SEQ ID NOS : 102 

; SOFTWARE: PERL Program 

; SEQ ID NO 61 

LENGTH: 4559 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE: 

NAME /KEY: misc_f eature 

OTHER INFORMATION: Incyte ID No. 6673545 008942.10 
US-09-919-172-61 



Query Match 1.7%; 
Best Local Similarity 51.7%; 
Matches 105; Conservative 
Gaps 0; 



Score 46.2; DB 4; Length 4559; 
Pred. No. 0.0058; 
0; Mismatches 98; Indels 0; 



Qy 2434 

GACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGC 2493 
II I I I I I I I I I I I I I I I I I I I I I I INN 

I I 

Db 1233 

GCCTTCTCCAAGGACATCTTCTCCGTCATCAACTTCTTCAGCTTCTTCAACTGGCTCTGC 1292 
Qy 2494 

CTGGCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCG 2553 

I I I I I I I I I I I I I I I I I I I II I I I I I 

I 

Db 1293 

GTGGCCCTGGCCATCATCGGCATGATCTGGCTGCGCCACAGAAAGCCTGAGCTTGAGCGG 1352 
Qy 2554 

GCCGGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGC 2613 

II II II II III I I I I I I I I I I I I I I I I 

I 

Db 1353 

CCCATCAAGGTGAACCTGGCCCTGCCTGTGTTCTTCATCCTGGCCTGCCTCTTCCTGATC 1412 

Qy 2614 GTGCTCTTGTACCGAAGGCGGCC 2636 

I III I I I I I II 
Db 1413 GC C GT CTCCTT CT GGAAGACACC 1435 



RESULT 13 
US-09-701-068-4 

; Sequence 4, Application US/09701068 

; Patent No. 6677506 

; GENERAL INFORMATION: 

; APPLICANT: Galil, Gad et al. 

; TITLE OF INVENTION: DNA CODING FOR A Mg2+/H+ OR Zn2+/H+ EXCHANGER AND 
TRANSGENIC PLANTS 

; TITLE OF INVENTION: EXPRESSING SAME 
; FILE REFERENCE: 01/21317 

; CURRENT APPLICATION NUMBER: US/ 09/701, 068 

; CURRENT FILING DATE: 2001-05-07 

; NUMBER OF SEQ ID NOS: 17 

; SOFTWARE: Patent In version 3.0 



; SEQ ID NO 4 

LENGTH: 2803 

TYPE: DNA 
; ORGANISM: Arabidopsis thaliana 
US-09-701-068-4 

Query Match 1.6%; Score 44.8; DB 4; Length 2803; 

Best Local Similarity 52.1%; Pred. No. 0.011; 

Matches 100; Conservative 0; Mismatches 92; Indels 0; 
Gaps 0; 

Qy 477 

GTT CAT T GCT GGT GAT CT GGGAC CT T CT ACCATT GT AGGGAGT GC AGC CTT CAACAT GT T 536 

III I I I I I I I I I I I II I I I I I I I I I I I I I I 

I I 

Db 950 

GTTTTGTCATATAGGTCTTGGTCCTGGAACACTTGTTGGCTCAGCTGCATTTGATCTTTT 1009 
Qy 537 

CAT CATCATTGGCAT CTGTGT CTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCA 596 

I I I I I I I I I I I I I III II I I I I I I I I I I 

I 

Db 1010 

CC C CAT C CAC GCT GT TT GT GT CGT T GTGCCAAAAGCT GGAGAACT GAAAAAGAT AT C CGA 1069 
Qy 597 

T CT AC GAGT CT T CT T CAT CAC CGCT GCTT GGAGT AT CT TT GC CT ACAT CT GGCT CT AT AT 656 

I I I I I I I I I III III I I I I I I I I I I I I I I I 

I I 

Db 1070 

CTTAGGTGTTTGGCTAGTTGAGCTCGTATGGTCTTTTTGGGCTTACATCTGGCTATACAT 112 9 

Qy 657 GATTCTGGCAGT 668 

I I I I I II 
Db 1130 AAT C CT C GAG GT 1141 



RESULT 14 

US-09-103-840A-2/C 

; Sequence 2, Application US/09103840A 

; Patent No. 6294328 

; GENERAL INFORMATION: 

; APPLICANT: FLEISCHMAN, Robert D. 

; APPLICANT: WHITE, Owen R. 

; APPLICANT: FRASER, Claire M. 

; APPLICANT: VENTER, John C. 

; TITLE OF INVENTION: DNA SEQUENCES FOR STRAIN ANALYSIS IN 
MYCOBACTERIUM 

; TITLE OF INVENTION: TUBERCULOSIS 

; FILE REFERENCE: 24366-20007.00 

; CURRENT APPLICATION NUMBER: US/09/ 103 , 84 OA 

; CURRENT FILING DATE: 1998-06-24 

; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 2 

LENGTH: 4403765 

TYPE: DNA 

; ORGANISM: Mycobacterium tuberculosis 
FEATURE : 

; OTHER INFORMATION: CDC 1551 



OTHER INFORMATION: "n" bases at various positions throughout the 
sequence 

OTHER INFORMATION: represent a, t, c or g 
US-09-103-840A-2 

Query Match 1.6%; Score 44.2; DB 3; Length 4403765; 

Best Local Similarity 46.3%; Pred. No. 4.2; 

Matches 145; Conservative 0; Mismatches 168; Indels 0; 
Gaps 0; 

Qy 2213 

CCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCA 2272 

III I I I I I I III I I I I I I I I I I I I I I I I II 

I 

Db 3929402 

CCTCCCTTACCGCCGTTGCCGCCGGCGCCGGCGGCGCCGGCTACGCCGCTGCCGAATCCC 3929343 
Qy 2273 

TCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTG 2332 

I II I I I I I I I I I II II I I I I I I I I I I I 

I 

Db 3929342 

GCGCCGCCGCCTTCGCCGCCGTCCCCACCGTCACCGCCTTGGCCGCCGGCGCCGCCCTCG 3929283 
Qy 2333 

GT CT CAAAGATT C AGT C AC AGCT GT T GTTT T CGT GGC ATTT GGC AC CT CTGT C C CAGAT A 2392 

I I I I I I I I I I I I I I I I I I I I I 

I 

Db 3929282 

CTCGCCACGCCTGTCGTTCCGTTCTGGCCGTCACCACCGGCCCCGCCGGTGCCGCCGGTG 3929223 
Qy 2393 

CGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACG 2452 

I II I I I I I I I I I I II I I I I I I I I I I I 

I 

Db 3929222 

CCGCCGGCCCCGTTGATGCCGCCGGCGCCGGCGTTGCCGCCGGCCCCGCCTTGGCCGCCT 3929163 
Qy 2453 

TGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCG 2512 

II I I I I I I I I I I I I I I I I I II I II I 

I I I I 

Db 3929162 

TGGCCGCCGGCGAAGCCGTTGCCGTCTTGGGAGAGGGCGCCGCTGTCGCCGGCCCCGCCG 3929103 

Qy 2513 CCATCTACTGGGC 2525 

I I I III 
Db 3929102 TCGCCGCCGCGGC 3929090 



RESULT 15 

US-09-252-991A-9693/C 

; Sequence 9693, Application US/09252991A 
; Patent No. 6551795 
; GENERAL INFORMATION: 

; APPLICANT: Marc J. Rubenfield et al . 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING 
TO PSEUDOMONAS 

; TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: 107196.136 



; CURRENT APPLICATION NUMBER: US/09/252, 991A 

; CURRENT FILING DATE: 1999-02-18 

; PRIOR APPLICATION NUMBER: US 60/074,788 

; PRIOR FILING DATE: 1998-02-18 

; PRIOR APPLICATION NUMBER: US 60/094,190 

; PRIOR FILING DATE: 1998-07-27 

; NUMBER OF SEQ ID NOS : 33142 

; SEQ ID NO 9693 

LENGTH: 1992 

TYPE: DNA 
; ORGANISM: Pseudomonas aeruginosa 
US-09-252-991A-9693 

Query Match 1.6%; Score 43.8; DB 4; Length 1992; 

Best Local Similarity 47.5%; Pred. No. 0.017; 

Matches 162; Conservative 0; Mismatches 177; Indels 2; 
Gaps 1; 

Qy 2372 

TT GGCAC CT CTGT CCC AGAT AC GT T TGCC AGCAAAGCTGCT GC CCT C CAGGAT GT ATAT G 2431 

I I I I I II II I I I I I I III I I I I I I I I I I I I I III Ml 

I 

Db 738 TTGGCACAGCTCGGCCTGTTCCGCCTGCTCGCCACGCTGCTGCC — 

GCCGGACGTACCAG 681 

Qy 2432 

CAGACGCCT CCAT TGGCAACGT GAC GGGC AGCAAC GCC GT CAAT GT CTT C CT GGGC AT C G 2491 

I I I I I I I II III I I I I I I III I I I I I 

I 

Db 680 

CCGGCGGCCTGTTCCCCGCCTTCGCCGGGGTCGCCACCGGCCTGGTATCGCTGGCCGGCT 621 
Qy 2492 

GCCTGGCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGT 2551 

I I II I I I I II I I I I I I I I I I III III 

Db 620 

TCGCCCTCCCGCCGCTGGCAGCCCTGGGCCGGGTACCGCCGTTGCGGGTCCTGCGCAGCG 561 
Qy 2552 

CGGCCGGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCA 2611 

I I I I I II I I I I I I I I I I I I 

Db 560 

ACCTGCTGCCGGTGCCGATGCGCACCTGGATGGCCTACGCCTGCGCCCTGCTCGCCCTGG 501 
Qy 2612 

GCGTGCTCTTGTACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCT 2671 



Db 500 

GCCTGATCATGTGGCGGTTGAGTCTCGACCTGAAGCTGACCCTCGCCCTGCTCGGCGGCG 441 

Qy 2672 GCAAGCTCGCCACAACATGGCTCTTTGTGAGCCTGTGGCTC 2712 

II I I I I I I I I II II I I I I I I I I I 

Db 440 GCCTGGTCGCCACGCTGGTGCTCGGCGCCCTCCTGCTGCTC 400 



Search completed: January 22, 2005, 17:35:36 
Job time : 241.467 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



updates/sec 



January 22, 2005, 07:25:49 ; Search time 1463.37 Seconds 

(without alignments) 
10860.644 Million cell 



Title: 

Perfect score: 

Sequence: 

2766 



US-10-054-680-1 
2766 

1 atggcgtggttaaggttgca. 



. gctacatcaaggggttctaa 



Scoring table: IDENTITY__NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 4300275 seqs, 2872944193 residues 

Total number of hits satisfying chosen parameters: 8600550 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_NA: * 

1: /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB.seq:* 

2 : Vcgn2_6/ptodata/2/pubpna/PCT_NEW_PUB.seq: * 

3: /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB.seq:* 

4 : / cgn2_6 /ptodata/2 /pubpna/US 0 6_PUBCOMB . seq : * 

5: /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB.seq: * 

6 : / cgn2_6 /pt oda t a / 2 /pubpna / PCTUS_PUBCOMB .seq:* 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8: /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB.seq:* 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB.seq:* 
10 : /cgn2_6/ptodata/2/pubpna/US09B__PUBCOMB. seq: * 
11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 
12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 
13: /cgn2_6/ptodata/2/pubpna/US10AJPUBCOMB.seq:* 
14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB.seq:* 
15: /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB. seq:* 
16: /cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB.seq:* 
17 : /cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB. seq: * 
18: /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB.seq:* 
19: /cgn2_6/ptodata/2/pubpna/USll_NEW_PUB.seq:* 
20 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 
21 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being 
printed, 

and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 



No. Score 


Match 


Length 


DB 


ID 


Description 




1 2766 


100.0 


2766 


13 


US-10-054-680-1 


Sequence 1, 


Appli 












2 2766 


100.0 


3812 


13 


US-10-054-680-5 


Sequence 5, 


Appli 












3 2761.2 


99.8 


2782 


9 


US-09-804-474A-1 


Sequence 1, 


Appli 












4 2733.4 


98.8 


2781 


14 


US-10-275-116-1 


Sequence 1, 


Appli 












5 2673.2 


96.6 


2685 


15 


US-10-114-153-5 


Sequence 5, 


Appli 












6 2657.6 


96.1 


2840 


15 


US-10-114-153-3 


Sequence 3, 


Appli 












7 2367.2 


85.6 


2813 


15 


US-10-114-153-1 


Sequence 1, 


Appli 












8 1786.4 


64.6 


2534 


15 


US-10-256-537-1 


Sequence 1, 


Appli 












9 1786.4 


64.6 


2534 


15 


US-10-256-537-3 


Sequence 3, 


Appli 












10 1784.8 


64.5 


126512 


9 


US-09-804-474A-3 


Sequence 3, 


Appli 












11 1784.6 


64.5 


1863 


13 


US-10-054-680-3 


Sequence 3, 


Appli 












12 1277 


46.2 


4282 


14 


US-10-281-866-1 


Sequence 1, 


Appli 












13 1277 


46.2 


4282 


14 


US-10-281-866-3 


Sequence 3, 


Appli 












14 1270.8 


45.9 


3004 


16 


US-10-388-934-506 


Sequence 


506, App 












15 1227.8 


44.4 


4087 


9 


US-09-901-419-1 


Sequence 1, 


Appli 












16 1208.8 


43.7 


5438 


18 


US-10-357-930-24294 


Sequence 


24294, A 












17 1207.2 


43.6 


6106 


16 


US-10-062-674-1648 


Sequence 


1648, Ap 












18 897.4 


32.4 


1187 


15 


US-10-243-552-809 


Sequence 


809, App 












c 19 878.4 


31.8 


2859 


18 


US-10-723-860-7370 


Sequence 


7370, Ap 












20 821.4 


29.7 


823 


15 


US-10-029-386-20265 


Sequence 


20265, A 












21 787.2 


28.5 


1836 


9 


US-09-864-761-16939 


Sequence 


16939, A 












22 503.4 


18.2 


505 


15 


US-10-029-386-6536 


Sequence 


6536, Ap 












23 494 


17.9 


551 


15 


US-10-029-386-4103 


Sequence 


4103, Ap 












24 366 


13.2 


366 


15 


US-10-029-386-17804 


Sequence 


17804, A 












25 336.4 


12.2 


507 


15 


US-10-029-386-4003 


Sequence 


4003, Ap 












c 26 219.4 


7.9 


381 


9 


US-09-864-761-1172 


Sequence 


1172, Ap 












27 210.2 


7.6 


502 


15 


US-10-029-386-7461 


Sequence 


7461, Ap 













28 


208.4 


7 


.5 


280 


15 


US-10-029-386-21161 


Sequence 


21161, A 
















29 


186.4 


6 


.7 


491 


9 


US-09-864-761-646 


Sequence 


646, App 
















30 


186.2 


6 


.7 


276 


9 


US-09-864-761-17437 


Sequence 


17437, A 
















31 


180 


6 


.5 


180 


15 


US-10-029-386-17706 


Sequence 


17706, A 
















32 


169 


6 


.1 


477 


10 


US-09-918-995-2005 


Sequence 


2005, Ap 
















33 


164.2 


5 


.9 


459 


9 


US-09-864-761-102 


Sequence 


102, App 
















34 


146.6 


5 


.3 


1132 


15 


US-10-369-493-30006 


Sequence 


30006, A 
















35 


108 


3 


.9 


1302 


15 


US-10-243-552-276 


Sequence 


276, App 
















c 36 


95 


3 


.4 


151 


9 


US-09-864-761-17938 


Sequence 


17938, A 
















37 


91.4 


3 


.3 


1792 


15 


US-10-369-493-29835 


Sequence 


29835, A 
















38 


80.8 


2 


.9 


911 


18 


US-10-425-115-88676 


Sequence 


88676, A 
















39 


77.4 


2 


.8 


1248 


18 


US-10-425-115-125495 


Sequence 


125495, 
















40 


76.6 


2 


.8 


1617 


9 


US-09-938-842A-2591 


Sequence 


2591, Ap 
















41 


76.6 


2 


.8 


1617 


11 


US-09-938-842A-2591 


Sequence 


2591, Ap 
















42 


74.2 


2 


.7 


611 


17 


US-10-767-701-15967 


Sequence 


15967, A 
















43 


73.4 


2 


.7 


968 


16 


US-10-424-599-94222 


Sequence 


94222, A 
















44 


60 


2 


.2 


128 


9 


US-09-864-761-20736 


Sequence 


20736, A 
















c 45 


60 


2 


.2 


136 


15 


US-10-029-386-23072 


Sequence 



23072, A 



ALIGNMENTS 



RESULT 1 
US-10-054-680-1 

; Sequence 1, Application US/10054680 
; Publication No. US20020132998A1 
; GENERAL INFORMATION: 
; APPLICANT: Friddle, Carl Johan 
; APPLICANT: Hilbun, Erin 

; TITLE OF INVENTION: No. US20020132998Alel Human Ion Exchanger 

Proteins and Polynucleotides Encoding the 

; TITLE OF INVENTION: Same 

; FILE REFERENCE: LEX-0301-USA 

; CURRENT APPLICATION NUMBER: US/10/054,680 

; CURRENT FILING DATE: 2002-01-22 

; PRIOR APPLICATION NUMBER: US 60/263,384 

; PRIOR FILING DATE: 2001-01-23 

; NUMBER OF SEQ ID NOS : 5 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 1 

; LENGTH: 2766 



; TYPE: DNA 

; ORGANISM: homo sapiens 
US-10-054-680-1 

Query Match 100.0%; Score 2766; DB 13; Length 2766; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2766; Conservative 0; Mismatches 0; Indels 0; 
Gaps 0; 

Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 
Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 
Qy 121 

ACAGGGCAGAACAATGAGT CCT GTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCT G 180 

I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I II I I 
Db 121 

ACAGGGCAGAACAATGAGT CCT GTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCT G 180 
Qy 181 

CCAAT CT GGT AC CC GGAGAAC CCTTCCCTT GGGGACAAGAT T GCCAGGGT CATT GT CT AT 240 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 
Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCAT CATT GCTGACCGCTT CAT GGCA 300 
Qy 301 

T CT AT T GAAGTCAT CAC CT CT CAAGAGAGGGAGGT GACAAT TAAGAAACC CAATGGAGAA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 301 

T CTATT GAAGTCATCACCTCTCAAGAGAGGGAGGT GACAAT TAAGAAACC CAATGGAGAA 360 
Qy 361 

AC C AGCACAACCACT ATT C GGGT CTGGAAT GAAACTGT CT CC AACCT GACCCTT AT GGCC 420 

I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 361 

ACCAGCACAACCACT AT TCGGGT CT GGAAT GAAACT GT CT CCAAC CT GACCCTT AT GGCC 420 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 



1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

Db 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 
Qy 481 

ATTGCTGGT GAT CT GGGACCTT CT ACCATT GT AGGGAGT GCAGCCTT CAACAT GT T CAT C 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 

ATT GCTGGTGAT CT GGGACCTT CT ACCATT GT AGGGAGT GCAGC CTT CAACAT GT T CATC 540 
Qy 541 

AT CAT TGGC AT CT GT GT CT ACGTGATCCCAGAC GGAGAGACT CGCAAGAT CAAGCAT CTA 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 541 

AT CATT GGCAT CT GT GT CT ACGT GAT C C CAGAC GGAGAGACT CGCAAGAT CAAGCAT CTA 600 
Qy 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 

CGAGT CTT CTT CAT CAC C GCT GCT T GGAGT AT CT T T GC CT ACAT CT GGCT CT AT AT GATT 660 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I I I I I I I I I I I II II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 
Qy 781 

AT GC ACAAAAAGT ACCGCACAGACAAACAC CGAGGAATTAT C AT AGAGACAGAGGGT GAC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 

AT GCACAAAAAGT AC CGCAC AGACAAAC ACC GAGGAATT AT CAT AGAGACAGAGGGT GAC 840 
Qy 841 

CAC C CT AAGGGC ATT GAGAT GGAT GGGAAAAT GAT GAATT C CC ATT TT CT AGAT GGGAAC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 

CAC CCTAAGGGCATT GAGAT GGAT GGGAAAAT GAT GAATT CCCAT TTT CT AGAT GGGAAC 900 
Qy 901 

CT GGTGCCCCTGGAAGGGAAGGAAGT GGATGAGT CCCGCAGAGAGAT GAT CCGGATT CTC 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 

CT GGT GCC CCT GGAAGGGAAGGAAGTGGAT GAGT CC C GCAGAGAGAT GAT CCGGATT CT C 960 



Qy 961 

AAGGATCT GAAGCAAAAACAC CC AGAGAAGGACTT AGAT CAGCT GGT GGAGAT GGC CAAT 1020 

I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I! I I I I 

Db 961 

AAGGATCT GAAGCAAAAACAC CCAGAGAAGGACT T AGAT CAGCT GGT GGAGAT GGC CAAT 1020 
Qy 1021 

T ACT ATGCT CTT T C CCAC CAACAGAAGAGC CGCGCCTT CTAC CGT AT CCAAGCC ACT CGT 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 

TACT ATGCT CTTT C C C AC CAACAGAAGAGC C GC GCCTT CT ACC GT AT C CAAGCC ACT C GT 1080 
Qy 1081 

AT GAT GACT GGTGCAGGCAATATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I II I I I I I I 
Db 1081 

AT GAT GACT GGT GCAGGCAAT AT C CTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 1140 
Qy 1141 

T CCAGC AT GAGCGAGGT GC ACAC CGAT GAGCCT GAGGACT T T ATTT C CAAGGT CTT CT TT 1200 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I II I I I I I I 
Db 1141 

T C CAGCAT GAGC GAGGT GCACAC CGAT GAGCCT GAGGACTTT ATTT C CAAGGT CTT CTTT 1200 
Qy 1201 

GACC CAT GTT CTT ACCAGT GC CT GGAGAACT GT GGGGCT GT ACT C CT GACAGT GGT GAGG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1201 

GACC CAT GTT CTT ACCAGT GCCT GGAGAACT GT GGGGCT GT ACT CCT GACAGT GGT GAGG 1260 

Qy 1261 

AAAGGGGGAGACAT GTCAAAGAC CAT GT AT GT GGACT ACAAAACAGAGGAT GGT T CT GC C 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1261 

AAAGGGGGAGACAT GTCAAAGAC CAT GT AT GT GGACT ACAAAACAGAGGAT GGTT CT GCC 1320 
Qy 1321 

AATGCAGGGGCTGACTAT GAGTTCACAGAGGGCACGGTGGTTCT GAAGCCAGGAGAGACC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 

AAT GCAGGGGCTGACTAT GAGT T CAC AGAGGGCACGGT GGT T CT GAAGCCAGGAGAGAC C 1380 
Qy 1381 

C AGAAGGAGT T CT CCGT GGGCATAATT GAT GAC GACATTT TT GAGGAGGAT GAACACT T C 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 1381 

CAGAAGGAGTT CT C C GT GGGCATAATT GAT GAC GACATTTTT GAGGAGGAT GAAC ACTT C 1440 
Qy 1441 

T T T GTAAGGT T GAGCAAT GT C C GC AT AGAGGAGGAGCAGC CAGAGGAGGGGAT GCCT C C A 1500 



1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1441 

TTT GTAAGGT T GAGCAAT GT CC GCATAGAGGAGGAGCAGCCAGAGGAGGGGAT GC CT CCA 1500 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 
Qy 1561 

GT TAC CAT CTT GGAT GAT GAC CAT GCAGGCAT CTT CACTT TT GAAT GT GAT ACT ATT CAT 1620 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1561 

GT TAC CAT CT TGGAT GAT GAC CAT GCAGGCAT CTT CACTTTT GAAT GT GAT ACTATT CAT 1620 
Qy 1621 

GT CAGTGAGAGT AT T GGT GT T AT GGAGGT CAAGGT T CT GCGGACAT C AGGT GC C CGGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 

GT CAGTGAGAGT ATT GGT GTT AT GGAGGTCAAGGTT CT GC GGACAT CAGGT GC CCGGGGT 1680 
Qy 1681 

AC AGT CAT CGT C C CCTTT AGGACAGTAGAAGGGACAGCCAAGGGT GGC GGT GAGGACTTT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 

AC AGT CAT C GT C CC CTTT AGGAC AGT AGAAGGGAC AGC CAAGGGT GGC GGT GAGGACT TT 1740 
Qy 1741 

GAAGACACATATGGGGAGTTGGAATTCAAGAAT GATGAAACTGT GAAAACCATAAGGGTT 1800 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1741 

GAAGACACATAT GGGGAGTT GGAATTCAAGAATGAT GAAACT GTGAAAACCATAAGGGTT 1800 
Qy 1801 

AAAAT AGTAGAT GAGGAGGAAT AC GAAAGGCAAGAGAATTT CTT C ATT GC C CT T GGT GAA 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1801 

AAAAT AGT AGAT GAGGAGGAAT AC GAAAGGCAAGAGAATTT CTT CAT T GCCCTT GGT GAA 18 60 
Qy 1861 

C C GAAAT GGAT GGAACGT GGAAT AT C AGAT GT GACAGACAGGAAGCT GAC TAT GGAAGAA 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1861 

CCGAAATGGATGGAACGT GGAATAT CAGATGT GACAGACAGGAAGCT GACT AT GGAAGAA 1920 
Qy 1921 

GAGGAGGC CAAGAGGAT AGCAGAGAT GGGAAAGCCAGT AT T GGGT GAACACCCCAAACT A 1980 



Db 1921 

GAGGAGGCCAAGAGGAT AGCAGAGAT GGGAAAGCCAGT ATT GGGT GAACAC CCCAAACT A 1980 



Qy 1981 

GAAGT CAT C ATT GAAGAGT CCT AT GAGTT CAAGACT ACGGT GGACAAACT GAT CAAGAAG 2040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I II I I I I I I I I I 
Db 1981 

GAAGTCATC ATT GAAGAGT C CT AT GAGTT CAAGACT ACGGT GGACAAACT GAT CAAGAAG 2040 
Qy 2041 

ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 2100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 2041 

ACAAACCTGGCCTTGGTT GT GGGGACCCATTCCT GGAGGGACCAGTT CAT GGAGGCCATC 2100 
Qy 2101 

ACC GTCAGT GCAGCAGGGGAT GAGGAT GAGGAT GAAT CCGGGGAGGAGAGGCT GCCCT C C 2160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2101 

ACC GT CAGT GCAGCAGGGGAT GAGGAT GAGGAT GAAT CC GGGGAGGAGAGGCT GCCCT C C 2160 
Qy 2161 

TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 2220 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2161 

TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 2220 
Qy 2221 

CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 2280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 2221 

CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 2280 
Qy 2281 

ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 2340 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2281 

ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 2340 
Qy 2341 

GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 2400 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2341 

GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 24 00 
Qy 2401 

AGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCCATTGGCAACGTGACGGGC 2460 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2401 

AGCAAAGCT GCT GCCCTCC AGGAT GT AT AT GCAGACGC CT CCATT GGCAACGT GAC GGGC 2460 
Qy 2461 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 2520 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 2461 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 2520 
Qy 2521 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2580 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2521 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2580 
Qy 2581 

CT CT T CACCAT CTT T GCAT TT GT CT GC AT CAGC GT GCT CTT GT AC CGAAGGCGGCCGCAC 2640 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2581 

CT CT T CAC CAT CT T T GCAT TT GT CT GCAT CAGC GT GCT CTT GT ACCGAAGGCGGCC GC AC 2640 
Qy 2641 

CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 2700 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2641 

CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 2700 
Qy 2701 

AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 2760 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2701 

AGCCT GT GGCT CCT CTACATACT CTT T GCCACACT AGAGGC CT AT T GCTACAT CAAGGGG 2760 

Qy 2761 TTCTAA 2766 

I II I I I 

Db 2761 TTCTAA 2766 
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Query Match 



100.0%; Score 2766; DB 13; Length 3812; 



Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2766; Conservative 0; Mismatches 0; Indels 0 
Gaps 0; 

Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 618 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 677 

Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 678 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 737 

Qy 121 

AC AGGGC AGAACAAT GAGT CCT GT T CAGGGT CAT CGGACT GCAAGGAGGGTGT CAT CCT G 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I N I I I I I I I II I I 

Db 738 

ACAGGGCAGAACAAT GAGT C CT GTT CAGGGT CAT CGGACT GCAAGGAGGGTGT CAT CCT G 797 
Qy 181 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 798 

CCAAT CT GGT AC CCGGAGAACC CT T CC CT T GGGGACAAGATTGC CAGGGT CATT GT CT AT 857 
Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 858 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 917 
Qy 301 

T CT ATT GAAGT CATCACCT CT CAAGAGAGGGAGGT GACAAT TAAGAAACC CAAT GGAGAA 360 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I 
Db 918 

TCTATTGAAGTCATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 977 
Qy 361 

ACCAGCACAACCACT ATT C GGGT CT GGAAT GAAACT GT CT CCAACCT GACCCTT AT GGCC 420 

I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 978 

AC CAGCACAACCACT ATT C GGGT CT GGAAT GAAACT GT CT C CAACCT GAC CCT T AT GGCC 1037 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1038 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 1097 



Qy 481 

AT TGCTGGT GAT CT GGGACCT T CT ACCATT GTAGGGAGT GCAGCCTT CAAC AT GTTCATC 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1098 

ATTGCTGGT GAT CTGGGAC CT T CT ACCATT GTAGGGAGTGCAGCCTT CAAC AT GTT CATC 1157 
Qy 541 

AT CAT TGGCAT CT GT GT CT AC GT GATC CCAGAC GGAGAGACT CGCAAGAT CAAGCAT CTA 600 

I I I I I I I I I I II I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1158 

AT CATT GGCAT CTGT GT CT ACGT GAT CC CAGACGGAGAGACT C GCAAGAT CAAGCAT CTA 1217 
Qy 601 

C GAGT CTT CTT CAT CACCGCT GCT T GGAGT AT CTTT GCCT ACAT CT GGCT CT AT AT GATT 660 

I I I I I I I I I I I I I I I I I I II I I I I I I I I II II I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1218 

C GAGT CT T CT T CAT CAC C GCT GCTT GGAGT AT CT T TGCCT AC AT CT GGCT CT AT AT GATT 1277 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1278 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 1337 
Qy 721 

TTt CCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 1338 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 1397 
Qy 781 

AT GC ACAAAAAGT AC CGC ACAGACAAACAC CGAGGAATTAT CAT AGAGACAGAGGGT GAC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1398 

AT GCACAAAAAGT ACCGCAC AGACAAAC ACC GAGGAAT TAT CAT AGAGACAGAGGGT GAC 1457 
Qy 841 

CACC CTAAGGGCATT GAGAT GGAT GGGAAAAT GAT GAATT CCCATTTT CT AGAT GGGAAC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1458 

CACC CTAAGGGCATT GAGAT GGAT GGGAAAAT GAT GAATT C CCAT TTT CT AGAT GGGAAC 1517 
Qy 901 

CT GGT GCC CCT GGAAGGGAAGGAAGT GGAT GAGT C C C GCAGAGAGAT GAT C CGGAT T CT C 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1518 

CTGGTGCCCCT GGAAGGGAAGGAAGT GGATGAGTCCCGCAGAGAGAT GATCCGGATT CT C 1577 
Qy 961 

AAGGAT CT GAAGCAAAAAC AC CCAGAGAAGGACTT AGAT CAGCT GGT GGAGAT GGC CAAT 1020 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 1578 

AAGGAT CT GAAGCAAAAACAC CCAGAGAAGGACTT AGATCAGCT GGTGGAGAT GGC CAAT 1637 
Qy 1021 

T ACT ATGCTCTTT C CCAC CAACAGAAGAGC CGCGC CTT CT ACC GT AT C CAAGC C ACT CGT 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1638 

T ACT ATGCT CTTT CC CAC CAACAGAAGAGC CGC GCCT T CT ACC GT AT C CAAGC CACT CGT 1697 
Qy 1081 

AT GAT GACT GGT GCAGGCAAT AT CCT GAAGAAACAT GCAGCAGAACAAGCCAAGAAGGCC 1140 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 1698 

ATGATGACTGGTGCAGGCAATAT CCTGAAGAAACAT GCAGCAGAACAAGCCAAGAAGGCC 1757 
Qy 1141 

T CCAGC AT GAGCGAGGT GCACACC GAT GAGCCT GAGGACT TTAT T T C CAAGGT CTT CTTT 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1758 

T CCAGC AT GAGC GAGGT GCACACC GAT GAGCCT GAGGACT TTAT T T C CAAGGT CTT CTTT 1817 
Qy 1201 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1818 

GAC C CAT GTT CTT AC CAGT GCCT GGAGAACT GT GGGGCT GT ACT CCT GAC AGT GGT GAGG 1877 
Qy 1261 

AAAGGGGGAGACAT GT CAAAGAC CAT GT AT GT GGACT ACAAAACAGAGGAT GGTT CT GC C 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1878 

AAAGGGGGAGACAT GTCAAAGAC CAT GT AT GT GGACT ACAAAACAGAGGAT GGTT CT GCC 1937 
Qy 1321 

AAT GCAGGGGCT GACT AT GAGTT CAC AGAGGGCACGGT GGTTCT GAAGCCAGGAGAGACC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1938 

AATGCAGGGGCT GAC TAT GAGT T CACAGAGGGC ACGGT GGTT CT GAAGCCAGGAGAGACC 1997 
Qy 1381 

C AGAAGGAGTT CT C C GT GGGCATAATT GAT GACGAC ATTTTT GAGGAGGAT GAACACT T C 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1998 

CAGAAGGAGTT CT CC GT GGGCATAATT GAT GAC GAC ATTTT T GAGGAGGAT GAACACT T C 2057 
Qy 1441 

TTTGTAAGGTT GAGCAAT GT CC GCAT AGAGGAGGAGCAGC C AGAGGAGGGGAT GC CT CCA 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2058 

TTTGTAAGGTT GAGCAAT GT CC GCAT AGAGGAGGAGCAGCCAGAGGAGGGGAT GCCT C CA 2117 
Qy 1501 



GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I 1 I I I I I 
Db 2118 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 2177 
Qy 1561 

GTT ACC ATCTT GGAT GAT GACC AT GC AGGC AT CTT C ACTTTTGAAT GTGAT ACT AT T CAT 1620 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 2178 

GTT AC C ATCTTGGATGAT GACC AT GC AGGCAT CTT CACTTTTGAATGT GAT ACT ATT CAT 2237 
Qy 1621 

GT CAGT GAGAGT ATT GGT GTT AT GGAGGT CAAGGT TCT GCGGAC AT CAGGTGC CCGGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I 
Db 2238 

GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 2297 
Qy 1681 

AC AGT CAT CGTC C CCT TT AGGAC AGT AGAAGGGACAGCCAAGGGTGGCGGT GAGGACTT T 1740 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2298 

AC AGT CAT C GT C CCCTTT AGGACAGT AGAAGGGACAGCCAAGGGT GGCGGT GAGGACTT T 2357 
Qy 1741 

GAAGACACATATGGGGAGTT GGAATT CAAGAATGATGAAACTGTGAAAACCATAAGGGTT 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2358 

GAAGACACATATGGGGAGTTGGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 2417 
Qy 1801 

AAAATAGT AGAT GAGGAGGAAT AC GAAAGGCAAGAGAATTT CT T CATT GCC CTT GGT GAA 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2418 

AAAAT AGTAGAT GAGGAGGAAT AC GAAAGGCAAGAGAATTT CTT CATTGC CCT T GGT GAA 2477 
Qy 1861 

C C GAAAT GGAT GGAACGT GGAAT AT C AGAT GT GACAGACAGGAAGCT GACT AT GGAAGAA 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2478 

CC GAAAT GGAT GGAAC GT GGAAT AT C AGAT GT GACAGACAGGAAGCT GACT AT GGAAGAA 2537 
Qy 1921 

GAGGAGGCCAAGAGGAT AGCAGAGAT GGGAAAGC C AGT AT T GGGTGAACACC C CAAACT A 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I. I I I I I I I I I I I I I I I I I I I 
Db 2538 

GAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTGGGTGAACACCCCAAACTA 2597 
Qy 1981 

GAAGT CAT CAT T GAAGAGT C CT AT GAGT T CAAGACT AC GGT GGACAAACT GAT CAAGAAG 2040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2598 



GAAGT CAT CAT T GAAGAGT CCT AT GAGTTCAAGACTAC GGT GGACAAACTGAT CAAGAAG 2657 
Qy 2041 

ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 2100 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 2658 

ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 2717 
Qy 2101 

ACCGTCAGT GCAGCAGGGGAT GAGGAT GAGGAT GAATCCGGGGAGGAGAGGCTGCCCTCC 2160 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 2718 

AC CGT CAGT GCAGCAGGGGAT GAGGAT GAGGAT GAAT C CGGGGAGGAGAGGCT GC CCT CC 2777 
Qy 2161 

TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 2220 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II 
Db 2778 

TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 2837 
Qy 2221 

CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 2280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 2838 

CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 2897 
Qy 2281 

ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 2340 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 2898 

ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 2957 
Qy 2341 

GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 2400 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 2958 

GATT CAGT CACAGCT GTT GT TT T C GTGGCATTT GGCACCT CT GT CCC AGAT AC GTTT GCC 3017 
Qy 2401 

AGCAAAGCT GCT GC C CT C C AGGAT GT AT AT GCAGAC GCCT C CAT T GGCAAC GT GAC GGGC 24 60 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 3018 

AGCAAAGCT GCT GC CCT C CAGGATGT AT AT GCAGACGCCTCC ATT GGCAAC GT GAC GGGC 3077 
Qy 2461 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 2520 

I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I 
Db 3078 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 3137 
Qy 2521 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2580 



1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3138 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 3197 
Qy 2581 

CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 2640 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3198 

CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 3257 
Qy 2641 

CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 2700 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 3258 

CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 3317 
Qy 2701 

AGCCT GTGGCTCCT CT ACAT ACT CT TT GCCAC ACT AGAGGC CT AT T GCTACAT CAAGGGG 2760 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3318 

AGCCT GT GGCT CCT CT ACAT ACTCT TT GCCAC ACT AGAGGC CT ATT GCTACAT CAAGGGG 3377 

Qy 2761 TTCTAA 2766 

I I I I I I 

Db 3378 TTCTAA 3383 



RESULT 3 

US-09-804-474A-1 

Sequence 1, Application US/09804474A 
Patent No. US20020119518A1 
GENERAL INFORMATION: 
APPLICANT: KODET, Stefan et al 

TITLE OF INVENTION: ISOLATED HUMAN TRANSPORTER PROTEINS, 
TITLE OF INVENTION: NUCLEIC ACID MOLECULES ENCODING HUMAN 
TRANSPORTER PROTEINS, 

TITLE OF INVENTION: AND USES THEREOF 
FILE REFERENCE: CL000891 

CURRENT APPLICATION NUMBER: US/09/804 , 474A 
CURRENT FILING DATE: 2001-03-13 
NUMBER OF SEQ ID NOS : 4 

SOFTWARE: Fast SEQ for Windows Version 4.0 
SEQ ID NO 1 
LENGTH: 2782 
TYPE: DNA 
ORGANISM: Human 
US-09-804-474A-1 

Query Match 99.8%; Score 2761.2; DB 9; Length 2782; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2763; Conservative 0; Mismatches 3; Indels 0; 
Gaps 0; 



Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 10 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 69 
Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 70 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 129 
Qy 121 

ACAGGGCAGAACAAT GAGT CCT GTT CAGGGT CAT C GGACTGCAAGGAGGGT GT C ATC CTG 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 130 

ACAGGGCAGAACAAT GAGT C CT GTT CAGGGT CAT C GGACT GCAAGGAGGGT GT CATCCTG 189 
Qy 181 

C CAAT CT GGT AC C C GGAGAACC CTT C CCT TGGGGACAAGATT GC CAGGGT C ATT GT CT AT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 190 

C CAAT CT GGT AC CC GGAGAACC CTT C CCT T GGGGACAAGATT GCCAGGGT C ATT GT CTAT 249 
Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 250 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 309 
Qy 301 

T CTAT T GAAGT CAT CACCT CTCAAGAGAGGGAGGT GACAAT TAAGAAAC CCAAT GGAGAA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 310 

TCTATT GAAGT CATCACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAAT GGAGAA 369 
Qy 361 

ACCAGCACAACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 370 

AC C AGC ACAACC ACT AT TC GGGT CT GGAAT GAAACT GT CT CCAAC CTGAC CCTT ATGGCC 429 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 430 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 489 
Qy 481 

AT T GCT GGT GAT CT GGGAC CTT CTAC C ATTGT AGGGAGT GCAGC CTT CAACATGTTCATC 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 490 

ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 549 



Qy 541 

AT CATTGGCATCT GTGTCT ACGTGATCCCAGACGGAGAGACT CGCAAGATCAAGCATCTA 600 

I I I I II I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 550 

ATCATTGGCATCT GTGTCT ACGTGATCCCAGACGGAGAGACTCGCAAGAT CAAGCATCT A 609 
Qy 601 

CGAGT CTTCT T CAT CAC C GCT GCTT GGAGT AT CT T T GC CT ACAT CT GGCT CT AT ATGAT T 660 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I 
Db 610 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 669 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 670 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 729 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 730 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 789 
Qy 781 

ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTAT CATAGAGACAGAGGGT GAC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 790 

AT GCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGT GAC 849 
Qy 841 

CACCCTAAGGGCATT GAGATGGATGGGAAAATGATGAATT CCCATTTT CTAGATGGGAAC 900 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 850 

CACCCTAAGGGCATT GAGATGGATGGGAAAATGATGAATTCCCATTTT CTAGATGGGAAC 909 
Qy 901 

CT GGT GC CCCT GGAAGGGAAGGAAGT GGAT GAGT C CC GCAGAGAGAT GAT CCGGATT CT C 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 910 

CT GGT GC C CCT GGAAGGGAAGGAAGT GGAT GAGT C CC GCAGAGAGAT GAT C C GGAT C CT C 969 
Qy 961 

AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 1020 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 970 

AAGGAT CT GAAGCAAAAACAC CCAGAGAAGGACT T AGAT CAGCT GGT GGAGAT GGCCAAT 1029 
Qy 1021 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 1030 • 

T ACT AT GCT CTTT CC CAC CAACAGAAGAGC CGC GC CTT CTACCGTAT CCAAGCCACTC GT 1089 
Qy 1081 

AT GATGACT GGT GC AGGCAATAT CCT GAAGAAAC AT GCAGCAGAACAAGC CAAGAAGGC C 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1090 

AT GATGACT GGT GCAGGCAAT AT CCT GAAGAAAC AT GCAGCAGAACAAGCCAAGAAGGC C 1149 
Qy 1141 

T CCAGCATGAGC GAGGTGCACACCGAT GAGC CTGAGGACTTTATTT CCAAGGT CTT CT TT 1200 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I 
Db 1150 

T CCAGCAT GAGC GAGGTGC ACACC GAT GAGC CT GAGGACTTTAT TT CCAAGGT CT T CTTT 1209 
Qy 1201 

GACC CAT GTT CTT ACC AGT GC CT GGAGAACT GT GGGGCTGTACT CCT GACAGT GGT GAGG 1260 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1210 

GACC CAT GTT CTT AC CAGT GC CTGGAGAACT GT GGGGCTGTACT CCT GACAGT GGT GAGG 1269 
Qy 1261 

AAAGGGGGAGAC AT GT CAAAGAC CAT GT AT GT GGACT ACAAAACAGAGGAT GGT T CT GCC 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1270 

AAAGGGGGAGAC AT GT CAAAGAC CAT GT AT GT GGACTACAAAACAGAGGAT GGTT CT GCC 1329 
Qy 1321 

AAT GC AGGGGCT GACT AT GAGTT CAC AGAGGGC AC GGT GGT T CT GAAGC CAGGAGAGACC 1380 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1330 

AAT GC AGGGGCT GACT AT GAGTT CAC AGAGGGCAC GGT GGT TCT GAAGC CAGGAGAGACC 1389 
Qy 1381 

CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 1440 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1390 

CAGAAGGAGTTCTCCGTGGGCATAATTGATGACGACATTTTTGAGGAGGATGAACACTTC 1449 
Qy 1441 

TTT GTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGAT GCCTCCA 1500 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I. 
Db 1450 

TTT GT AAGGT T GAGCAAT GT C CG CAT AGAGGAGGAGCAGCCAGAGGAGGGGAT GC CT CCA 1509 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1510 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1569 
Qy 1561 



GTTACCATCTTGGATGATGACCATGCAGGCATCTTCACTTTTGAATGTGATACTATTCAT 1620 

I I I I I I I I I 1 I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1570 

GTT ACCATCTTGGAT GAT GACCAT GCAGGC AT CT T CACTTTTGAAT GTGATACT AT T CAT 1629 
Qy 1621 

GTCAGT GAGAGTATTGGT GTTATGGAGGTCAAGGTTCT GCGGACATCAGGTGCCCGGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1630 

GT CAGT GAGAGT ATT GGT GTT ATGGAGGT CAAGGTT CT GC GGAC AT CAGGT GC C CGGGGT 1689 
Qy 1681 

ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1690 

ACAGTCATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 1749 
Qy 1741 

GAAGACACATATGGGGAGTT GGAATTCAAGAAT GATGAAACTGTGAAAACCATAAGGGTT 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1750 

GAAGACACATATGGGGAGTT GGAATT CAAGAAT GAT GAAACTGTGAAAACCATAAGGGTT 1809 
Qy 1801 

AAAAT AGTAGAT GAGGAGGAAT AC GAAAGGCAAGAGAATTT CTT CATT GCCCTTGGTGAA 1860 

I I I I I I I I II I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 1810 

AAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATTTCTTCATTGCCCTTGGTGAA 1869 
Qy 1861 

C CGAAAT GGAT GGAACGT GGAAT AT CAGAT GT GACAGACAGGAAGCT GACTAT GGAAGAA 1920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I 
Db 1870 

CC GAAAT GGATGGAAC GT GGAAT AT CAGAT GT GACAGACAGGAAGCT GACTAT GGAAGAA 1929 
Qy 1921 

GAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTGGGTGAACACCCCAAACTA 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1930 

GAGGAGGC CAAGAGGAT AGCAGAGATGGGAAAGCC AGT AT T GGGT GAAC ACC CCAAACT G 1989 
Qy 1981 

GAAGTCATCATTGAAGAGT CCT AT GAGTTCAAGACTACGGTGGACAAACTGATCAAGAAG 2040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II II I I I I I I I I I I I I I I I I I 
Db 1990 

GAAGTCATCATTGAAGAGT CCTAT GAGTTCAAGACTACGGT GGACAAACTGATCAAGAAG 2049 
Qy 2041 

ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 2100 

I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 2050 



ACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCATGGAGGCCATC 2109 
Qy 2101 

ACCGTCAGTGCAGCAGGGGAT GAGGATGAGGATGAATCCGGGGAGGAGAGGCT GCCCTCC 2160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 2110 

ACCGT CAGT GCAGCAGGGGAT GAGGAT GAGGAT GAAT CCGGGGAGGAGAGGCT GC CCT CC 2169 
Qy 2161 

TGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTG 2220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2170 

T GCTT TGACT AC GT CAT GCACT T CCTGACT GT CT T CT GGAAGGT GCT GTT T GCCT GT GTG 2229 
Qy 2221 

CCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGC 2280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2230 

CCCCCCACAGAGTACTGCCACGGCTGGGCCT GCTT CGCCGTCTCCATCCT CAT CATTGGC 2289 
Qy 2281 

ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 2340 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2290 

ATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAA 2349 
Qy 2341 

GATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 2400 

Mill 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2350 

GATTCGGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCC 2409 
Qy 2401 

AGCAAAGCT GCT GC C CT CCAGGAT GT AT AT GC AGAC GCCT C CAT T GGCAACGT GAC GGGC 2460 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2410 

AGCAAAGCT GCT GCCCTCCAGGATGT AT AT GCAGACGCCTCCATTGGCAACGTGACGGGC 2469 
Qy 2461 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 2520 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 2470 

AGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTAC 2529 
Qy 2521 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2580 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 2530 

TGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACC 2589 
Qy 2581 

CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 2640 



Db 2590 

CTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCAC 
Qy 2641 

CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I 
Db 2650 

CTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTG 
Qy 2701 

AGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGG 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2710 

AGC CT GT GGCT CCT CT ACAT ACT CT T TGC C AC ACT AGAGGC CT AT T GCT ACATCAAGGGG 

Qy 2761 TTCTAA 2766 

I I I I I I 

Db 2770 TTCTAA 2775 



RESULT 4 
US-10-275-116-1 

; Sequence 1, Application US/10275116 
; Publication No. US20030096312A1 
; GENERAL INFORMATION: 
; APPLICANT: Merck Patent GmbH 
; TITLE OF INVENTION: No. US20030096312Alel natrium-calium exchanger 
protein 

; FILE REFERENCE: HNCX3CWWS 

; CURRENT APPLICATION NUMBER: US/ 10/275, 116 

; CURRENT FILING DATE: 2002-11-01 

; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 1 

LENGTH: 2781 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE: 

NAME/ KEY: CDS 

LOCATION: (1) . . (2781) 
US-10-275-116-1 

Query Match 98.8%; Score 2733.4; DB 14; Length 2781; 

Best Local Similarity 99.3%; Pred. No. 0; 

Matches 2762; Conservative 0; Mismatches 1; Indels 18; 
Gaps 1; 

Qy l 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I II I II I I I I I I I I I 
Db 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 
Qy 61 

TTT GT GCTCTT CCT GAATGGTCTT CGAGCAGAGGCTGGT GGCTCAGGGGACGTGCCAAGC 120 



2649 
2700 

2709 
2760 

2769 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 
Qy 121 

ACAGGGCAGAACAATGAGTCCT GTT CAGGGTCAT CGGACTGCAAGGAGGGTGTCATCCTG 180 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 

ACAGGGCAGAACAAT GAGT CCT GTT CAGGGT CAT CGGACT GCAAGGAGGGT GT CAT CCTG 180 
Qy 181 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 

CCAAT CT GGT AC CC GGAGAAC C CT T C C CTT GGGGACAAGAT T GCCAGGGT C ATT GT CT AT 240 
Qy 241 

TTTGT GGCCCT GAT AT ACAT GT T C CTT GGGGT GT C CAT CAT TGCT GACCGCTT CAT GGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 
Qy 301 

TCTATTGAAGT CATCACCT CTCAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 

T CTAT T GAAGT CAT CACCT CT CAAGAGAGGGAGGT GACAATTAAGAAACCCAAT GGAGAA 360 
Qy 361 

AC CAGC ACAACCACT AT T CGGGT CT GGAAT GAAACTGT CT CCAAC CT GACCCT T AT GGCC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 

AC CAGCACAAC C ACT AT T CGGGT CT GGAAT GAAACT GT CT CCAACCT GACC CTT AT GGCC 420 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 
Qy 481 

ATTGCTGGTGAT CT GGGACCTT CT ACCATTGTAGGGAGTGCAGCCTT CAACAT GTT CATC 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 

ATTGCTGGT GAT CT GGGACCTT CT AC C ATT GTAGGGAGT GCAGCCTT CAACAT GTT CAT C 540 
Qy 541 

AT CATT GGC AT CT GT GT CT AC GT GAT CC CAGACGGAGAGACT C GCAAGAT CAAGCAT CT A 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 

ATCATTGGCAT CT GTGTCT ACGT GATCCCAGACGGAGAGACTCGCAAGATCAAGCATCT A 600 



Qy 601 

CGAGT CT T CTT CAT C ACCGCT GCTT GGAGT AT CTTTGC CT ACAT CT GGCT CT AT AT GATT 660 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 

Db 601 

C GAGT CT T CTT CAT C ACC GCT GCTT GGAGT AT CT TTGC CT AC AT CT GGCT CT AT AT GAT T 660 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 
Qy 781 

AT GCACAAAAAGT AC CGC ACAGACAAACACC GAGGAATT AT CAT AGAGACAGAGGGT GAC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 781 

AT GCACAAAAAGT ACCGCACAGACAAACACCGAGGAATTAT CAT AGAGACAGAGGGT GAC 840 
Qy 841 

CACCCTAAGGGCATTGAGAT GGATGGGAAAAT GATGAATT CCCATTTTCTAGATGGGAAC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 841 

CACCCTAAGGGCATTGAGAT GGATGGGAAAAT GATGAATTCCCATTTTCTAGATGGGAAC 900 
Qy 901 

CT GGT GCCCCT GGAAGGGAAGGAAGT GGAT GAGT CCC GCAGAGAGAT GAT C CGGATT CTC 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 

CT GGT GCCCCT GGAAGGGAAGGAAGT GGAT GAGT CCCGCAGAGAGAT GAT C CGGATT CT C 960 
Qy 961 

AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCTGGTGGAGATGGCCAAT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 

AAGGAT CT GAAGCAAAAACACC C AGAGAAGGACTT AGAT CAGCT GGT GGAGAT GGC CAAT 1020 
Qy 1021 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 

T ACT AT GCT CTTT CC CAC CAACAGAAGAGC C GT GC CT T CT ACC GT AT C CAAGCCACT CGT 1080 
Qy 1081 

AT GAT GACTGGT GC AGGCAAT AT C CTGAAGAAACAT GC AGCAGAACAAGCCAAGAAGGCC 1140 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1081 

AT GATGACT GGT GCAGGCAAT AT C CT GAAGAAACAT GCAGC AGAACAAGCCAAGAAGGC C 1140 
Qy 1141 

T CCAGCAT GAGC GAGGT GC ACAC CGAT GAGC CT GAGGACTTT AT TT C CAAGGTCTT CTT T 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I 
Db 1141 

T C C AGCAT GAGC GAGGT GCACACCGAT GAGC CT GAGGACT TT AT TT C CAAGGTCTT CTT T 1200 
Qy 1201 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 



Db 1201 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 
Qy 1261 

AAAGGGGGAGACAT GT CAAAGAC CAT GT AT GTGGACTACAAAACAGAGGAT GGTT CT GC C 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1261 

AAAGGGGGAGACAT GT CAAAGAC CAT GT AT GT GGACTACAAAACAGAGGAT GGTT CT GCC 1320 
Qy 1321 

AAT GCAGGGGCTGACTAT GAGTTCACAGAGGGCACGGT GGTTCT GAAGCCAGGAGAGACC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 

AATGCAGGGGCT GACTAT GAGTT CACAGAGGGCACGGTGGTT CT GAAGCCAGGAGAGACC 1380 
Qy 1381 

CAGAAGGAGTTCTCCGT GGGCATAATTGAT GACGACATTTTT GAGGAGGATGAACACTT C 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 1381 

C AGAAGGAGTT CT CCGT GGGCAT AATT GAT GAC GACAT TT TTGAGGAGGAT GAAC ACTT C 1440 
Qy 1441 

TTTGTAAGGTT GAGCAAT GTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 1441 

T TT GTAAGGTT GAGCAAT GT CC GCAT AGAGGAGGAGCAGC CAGAGGAGGGGAT GCCT CC A 1500 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 
Qy 1561 

GTTACCAT CTT GGATGAT GACCATGCAGGCATCTTCACTTTTGAATGTGAT ACTATTCAT 1620 



Db 1561 

GT T AC CAT CTT GGAT GAT GAC CAT GCAGGCAT CT T CACT T TT GAAT GT GAT ACT ATT CAT 1620 



Qy 1621 

GT C AGT GAGAGT AT TGGT GTTAT GGAGGT CAAGGTT CT GC GGACAT CAGGT GCCCGGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 

GT CAGT GAGAGT ATT GGT GTTAT GGAGGT CAAGGT T CT GCGGACAT CAGGT GCC CGGGGT 1680 
Qy 1681 

ACAGTCAT C GT CCCCTTT AGGACAGT AGAAGGGACAGCCAAGGGT GGCGGT GAGGACTT T 1740 

I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1681 

ACAGT CAT CGT C CC CTT T AGGACAGT AGAAGGGACAGC CAAGGGT GGC GGT GAGGACTTT 1740 
Qy 1741 

GAAGAC ACAT AT GGGGAGTT GGAATTCAAGAAT GAT GAAACT GT GAAAACCATAAGGGT T 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1741' 

GAAGACACATAT GGGGAGTT GGAATTCAAGAAT GAT GAAACTGT GAAAACCATAAGGGTT 1800 
Qy 1801 

AAAAT AGT AGAT GAGGAGGAAT AC GAAAGGCAAGAGAATT T CTT CAT T GCC CTT GGT GAA 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1801 

AAAAT AGT AGAT GAGGAGGAAT AC GAAAGGCAAGAGAAT TT CT T CATT GCCCTT GGT GAA 1860 

Qy 1861 CCGAAATGGAT GGAACGTGGAATATC 

AGAT GT GACAGAC AGG 1902 

I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I 
Db 1861 

CCGAAAT GGATGGAACGT GGAATAT CAGGTGTGAGATTCTTTAAAGATGTGACAGACAGG 1920 
Qy 1903 

AAGCT GACT AT GGAAGAAGAGGAGGCCAAGAGGAT AGC AGAGAT GGGAAAGCCAGT AT T G 1962 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1921 

AAGCT GACT AT GGAAGAAGAGGAGGCCAAGAGGAT AGCAGAGAT GGGAAAGCCAGT ATT G 1980 
Qy 1963 

GGT GAACACCCCAAACTAGAAGTCATCATT GAAGAGTCCTATGAGTT CAAGACTACGGT G 2022 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 1981 

GGT GAAC AC CC CAAACT AGAAGT C ATCATT GAAGAGT C CT ATGAGTT CAAGACT AC GGT G 2040 
Qy 2023 

GACAAACT GAT CAAGAAGACAAACCT GGC CTT GGTT GT GGGGAC C C ATT CCT GGAGGGAC 2082 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2041 

GACAAACT GAT CAAGAAGACAAACCTGGC CTTGGTT GT GGGGACC C ATT CCTGGAGGGAC 2100 
Qy 2083 

CAGTT C AT GGAGGCCAT C AC C GT C AGT GCAGCAGGGGAT GAGGAT GAGGAT GAAT CCGGG 2142 
I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 2101 

CAGTT C ATGGAGGCCAT C ACC GT C AGT GCAGCAGGGGATGAGGATGAGGAT GAAT C CGGG 2160 
Qy 2143 

GAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAG 2202 

I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2161 

GAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAG 2220 
Qy 2203 

GTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTC 2262 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 2221 

GTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTC 2280 
Qy 2263 

TCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGC 2322 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 2281 

TCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGC 2340 
Qy 2323 

TGCACCATT GGT CTCAAAGATT CAGT CACAGCTGTTGTTTTCGT GGCATTTGGCACCT CT 2382 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 2341 

TGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCT 2400 
Qy 2383 

GT C C C AGAT ACGT TT GCC AGCAAAGCT GCT GC C CT CCAGGAT GT AT AT GCAGAC GC CT C C 2442 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2401 

GTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATGTATATGCAGACGCCTCC 2460 
Qy 2443 

ATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGG 2502 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2461 

ATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGG 2520 
Qy 2503 

TCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACA 2562 

I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2521 

TCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACA 2580 
Qy 2563 

CTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTG 2 622 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2581 

CTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTG 2640 
Qy 2623 



TACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCC 2682 



I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 2641 

TACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCC 2700 
Qy 2683 

ACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCC 2742 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I 
Db 2701 

ACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCC 2760 

Qy 2743 T ATT GCT ACAT CAAGGGGTT C 2763 

I I I I I I II I I I I I I II I I I I I 
Db 2761 T ATT GCT ACAT CAAGGGGTT C 2781 



RESULT 5 
US-10-114-153-5 

Sequence 5, Application US/10114153 
Publication No. US20030185815A1 
GENERAL INFORMATION: 



Padigaru, Muralidhara 
Shenoy, Suresh 
Ke kuda , Rame s h 
Rastelli, Luca 
Mezes, Peter 
Smithson, Glennda 
Guo, Xiaojia 
Gerlach, Valerie 
Casman, Stacie 
Boldog, Ferenc 
Li, Li 

Zerhusen, Bryan 
Tchernev, Velizar 
Gangolli, Esha 
Vernet, Corine 
Spytek, Kimberly 
Malyankar, Uriel 
Patturajan, Meera 
Miller, Charles 
Taupier, Raymond 
Heyes, Melvyn 
Ju, Jingfang 
Peyman, John 
Catterton, Elina 
MacDougall, John 
Edinger, Shlomit 
Stone, David 
Ann 

NOVEL ANTIBODIES THAT BIND TO ANTIGENIC 
POLYPEPTIDES, NUCLEIC ACIDS 

TITLE OF INVENTION: ENCODING THE ANTIGENS, AND METHODS OF USE 
FILE REFERENCE: 21402-322A 

CURRENT APPLICATION NUMBER: US/10/114,153 
CURRENT FILING DATE: 2002-08-06 
PRIOR APPLICATION NUMBER: 60/281086 
PRIOR FILING DATE: 2001-04-03 
PRIOR APPLICATION NUMBER: 60/281906 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Jr. 



Mazur, 



TITLE OF INVENTION: 



; PRIOR FILING DATE: 2001-04-05 

PRIOR APPLICATION NUMBER: 60/282020 
; PRIOR FILING DATE: 2001-04-06 
; PRIOR APPLICATION NUMBER: 60/282930 
; PRIOR FILING DATE: 2001-04-10 
; PRIOR APPLICATION NUMBER: 60/283512 
; PRIOR FILING DATE: 2001-04-12 
; PRIOR APPLICATION NUMBER: 60/283444 
; PRIOR FILING DATE: 2001-04-12 
; PRIOR APPLICATION NUMBER: 60/283657 
; PRIOR FILING DATE: 2001-04-13 
; PRIOR APPLICATION NUMBER: 60/283710 
; PRIOR FILING DATE: 2001-04-13 
; PRIOR APPLICATION NUMBER: 60/283678 
; PRIOR FILING DATE: 2001-04-13 
; PRIOR APPLICATION NUMBER: 60/284234 
; PRIOR FILING DATE: 2001-04-17 

; Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 251 
; SEQ ID NO 5 

LENGTH: 2685 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE: 

NAME/ KEY: CDS 

LOCATION: (1) ■ . (2685) 
US-10-114-153-5 

Query Match 96.6%; Score 2673.2; DB 15; Length 2685; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 2675; Conservative 0; Mismatches 3; Indels 0; 
Gaps 0; 

Qy 86 

GAGCAGAGGCT GGT GGCTCAGGGGACGT GCCAAGCACAGGGCAGAACAAT GAGT CCTGTT 145 
I I I 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 

GAT CC GAGGCT GGT GGCT CAGGGGAC GT GC CAAGC ACAGGGCAGAACAAT GAGT C CT GTT 61 
Qy 146 

CAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTGCCAATCTGGTACCCGGAGAACCCTT 205 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 62 

CAGGGT CAT CGGACTGCAAGGAGGGT GT C ATCCT GCCAAT CTGGT ACCC GGAGAACC CTT 121 
Qy 206 

C C CT T GGGGACAAGAT T GC CAGGGT CATT GTCT ATTTT GT GGC C CT GAT AT ACAT GTTCC 265 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 122 

CCCTTGGGGACAAGATTGCCAGGGTCATTGTCTATTTTGTGGCCCTGATATACATGTTCC 181 
Qy 266 

TT GGGGT GT CCAT CAT T GCT GAC CGCT T C ATGGCAT CTAT T GAAGT CAT CAC CT CT CAAG 325 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 182 

T T GGGGT GT CCAT CAT T GCT GAC C GCTT CATGGCAT CT ATT GAAGT CAT CAC CT CT CAAG 241 



Qy 326 

AGAGGGAGGT GACAATTAAGAAAC C CAAT GGAGAAACCAGCACAAC C ACT ATT CGGGTCT 385 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 242 

AGAGGGAGGT GACAAT TAAGAAAC C CAAT GGAGAAACCAGCACAACC ACT ATT C GGGT CT 301 
Qy 386 

GGAAT GAAACTGT CT C CAACCT GACC CT TAT GGC CCT GGGT T C CTCT GCT CCT GAGAT AC 445 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II 
Db 302 

GGAAT GAAACTGT CT C CAACCT GAC C CTT AT GGCC CT GGGTTCCTCT GCT C CT GAGAT AC 361 
Qy 446 

TCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGATCTGGGACCTTCTA 505 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 362 

TCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGATCTGGGACCTTCTA 421 
Qy 506 

CCAT TGT AGGGAGTGCAGCCTT CAAC AT GTT CAT CAT CATT GGC AT CTGT GT CT AC GT GA 565 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 422 

CCATT GT AGGGAGT GC AGC CTT CAACAT GTT CAT CAT CATT GGCAT CT GT GT CT ACGT GA 481 
Qy 566 

T C CCAGAC GGAGAGACT CGCAAGAT CAAGCAT CT AC GAGT CTT CTT CAT C ACC GCT GCTT 62 5 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 482 

T CC CAGACGGAGAGACTCGCAAGAT CAAAC AT CT AC GAGT CTT CTT CAT CACCGCT GCTT 541 
Qy 626 

GGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGG 685 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 542 

GGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGG 601 
Qy 686 

TCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCT 745 



Db 602 

TCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCT" 661 
Qy 746 

GGGT GGCAGATAAAC GACT GCT CT T CT ACAAAT ACAT GCACAAAAAGT ACCGCACAGAC A 805 

I I I I I I I I I II I I I I I I I I I I II I II I I I I I I I I I I.I I I I I I I I II I II I I I I I I I I I I I 

Db 662 

GGGT GGC AGATAAAC GACT GCT CT T CT ACAAAT ACAT GCACAAAAAGT AC CGCACAGAC A 721 
Qy 806 

AAC AC CGAGGAATT AT CAT AGAGAC AGAGGGT GAC CACCCTAAGGGCAT T GAGAT GGAT G 865 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 722 

AACACC GAGGAATT AT CAT AGAGACAGAGGGTGAC CACCCTAAGGGCATTGAGAT GGAT G 781 
Qy 866 

GGAAAAT GAT GAATTC CCATTTT CT AGAT GGGAAC CT GGT GC CC CT GGAAGGGAAGGAAG 925 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 782 

GGAAAAT GAT GAATTC CCATTTT CT AGAT GGGAAC CT GGT GC CC CT GGAAGGGAAGGAAG 841 
Qy 926 

TGGAT GAGTCCCGCAGAGAGATGAT C C GGATT CT CAAGGAT CTGAAGCAAAAAC ACCCAG 985 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 842 

T GGAT GAGT CC CGCAGAGAGATGAT CC GGATT CT CAAGGAT CTGAAGCAAAAACACCCAG 901 
Qy 986 

AGAAGGACTTAGAT CAGCT GGTGGAGATGGCCAATTACTATGCT CTTT CCCACCAACAGA 1045 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 902 

AGAAGGACTTAGAT CAGCT GGT GGAGAT GGC CAATTACT AT GCT CT TT CCCACCAACAGA 961 
Qy 1046 

AGAGC C GC GCCTT CTACC GT AT C CAAGCCACTCGT AT GAT GACT GGT GC AGGCAAT AT C C 1105 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 962 

AGAGCC GCGC CTT CTACCGT AT C CAAGCCACTCGT AT GAT GACT GGT GC AGGCAAT AT C C 1021 
Qy 1106 

TGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCCTCCAGCATGAGCGAGGTGCACACCG 1165 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1022 

T GAAGAAACATGCAGCAGAACAAGCCAAGAAGGC CTC CAGCAT GAGCGAGGT GCAC AC CG 1081 
Qy 1166 

ATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTTGACCCATGTTCTTACCAGTGCCTGG 1225 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 1082 

ATGAGCCTGAGGACTTTATTTCCAAGGTCTTCTTTGACCCATGTTCTTACCAGTGCCTGG 1141 
Qy 1226 

AGAACT GT GGGGCT GT ACT C CT GAC AGT GGT GAGGAAAGGGGGAGACAT GT CAAAGACCA 1285 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1142 

AGAACT GT GGGGCT GT ACT CCT GACAGT GGT GAGGAAAGGGGGAGACAT GT CAAAGACCA 1201 
Qy 1286 

T GT AT GT GGACT ACAAAAC AGAGGAT GGTT CT GC CAAT GC AGGGGCT GACT AT GAGTT CA 1345 



Db 1202 

T GT AT GT GGACT ACAAAAC AGAGGAT GGT T CT GCCAAT GCAGGGGCT GACT AT GAGTT C A 1261 



Qy 1346 

CAGAGGGC AC GGT GGTT CT GAAGCCAGGAGAGAC CCAGAAGGAGTT CT CCGT GGGCATAA 1405 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1262 

CAGAGGGC AC GGT GGTT CT GAAGCCAGGAGAGAC C CAGAAGGAGTT CT C CGT GGGCATAA 1321 
Qy 1406 

T T GAT GACGACATTTTT GAG GAGGAT GAACACT T CTT T GTAAGGTT GAGCAATGT CCGCA 1465 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II II I I I I I I 
Db 1322 

T TGAT GACGACATTTTT GAGGAGGAT GAACACT T CTT T GTAAGGTT GAGCAATGT CCGCA 1381 
Qy 1466 

T AGAGGAGGAGC AGCCAGAGGAGGGGAT GC CT C C AGCAAT ATT CAACAGT CTTC C CTT GC 1525 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1382 

TAGAGGAGGAGCAGCCAGAGGAGGGGAT GCCT C CAGCAAT ATT CAACAGT CTTCC CTT GC 1441 
Qy 1526 

CTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGATGATGACCATG 1585 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1442 

CTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGATGATGACCATG 1501 
Qy 1586 

CAGGCATCTT CACTTTTGAATGT GAT ACT ATT CATGTCAGT GAGAGTATTGGTGTTATGG 1645 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 1502 

CAGGCAT CTT CACTTT T GAATGT GAT ACT AT T CAT GT CAGT GAGAGT ATTGGTGT TAT GG 1561 
Qy 1646 

AGGT CAAGGT T CT GCGGACAT C AGGT GC C CGGGGT AC AGT CAT CGT CC C CTTT AGGACAG 1705 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1562 

AGGT CAAGGT T CT GCGGACATC AGGT GC CCGGGGT ACAGT CAT C GT CC C CT TT AGGACAG 1621 
Qy 1706 

TAGAAGGGAC AGCCAAGGGT GGC GGT GAGGACTTTGAAGACACAT ATGGGGAGTT GGAAT 1765 

I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1622 

TAGAAGGGACAGCCAAGGGTGGCGGT GAGGACTTTGAAGACACAT ATGGGGAGTT GGAAT 1681 
Qy 1766 

T CAAGAAT GAT GAAACT GT GAAAAC C ATAAGGGT T AAAAT AGT AGAT GAGGAGGAAT ACG 1825 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1682 

T CAAGAAT GAT GAAACT GT GAAAACC AT AAGGGTTAAAAT AGT AGATGAGGAGGAAT ACG 1741 
Qy 1826 

AAAGGCAAGAGAATTT CTT CAT T GC C CTT GGT GAAC CGAAAT GGAT GGAAC GT GGAAT AT 1885 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 1742 

AAAGGCAAGAGAATTT CTT C ATT GCC CT T GGT GAACCGAAATGGAT GGAACGT GGAAT AT 1801 
Qy 1886 

CAGAT GT GACAGACAGGAAGCT GACT AT GGAAGAAGAGGAGGCCAAGAGGAT AGCAGAGA 1945 

I I I II I I I I I I I II I I I I I I I II II I I I I I I I I II II I I I I I I 1 I I I I I I I I I I I I I I I I 
Db 1802 

CAGATGT GACAGACAGGAAGCT GACT AT GGAAGAAGAGGAGGCCAAGAGGAT AGCAGAGA 1861 
Qy 1946 

T GGGAAAGCCAGT AT T GGGT GAACACC C CAAACT AGAAGT CAT CATT GAAGAGT CCT AT G 2005 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1862 

T GGGAAAGCCAGT AT T GGGT GAACACC C CAAACT AGAAGT CAT C ATT GAAGAGTCCT AT G 1921 
Qy 2006 

AGTT CAAGACTAC GGT GGACAAACT GAT CAAGAAGACAAACCT GGCCTT GGT T GT GGGGA 2065 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1922 

AGTT CAAGACTAC GGT GGACAAACT GAT CAAGAAGACAAACCT GGCCTT GGTT GT GGGGA 1981 
Qy 2066 

CCCATT CCTGGAGGGACCAGTT CAT GGAGGCCAT CACCGT CAGT GCAGCAGGGGATGAGG 2125 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1982 

CCCATT C CT GGAGGGAC CAGT T CAT GGAGGCCAT CACCGT CAGT GC AGCAGGGGAT GAGG 2041 
Qy 2126 

AT GAGGAT GAAT C CGGGGAGGAGAGGCT GC C CT CCT GCTTT GACTACGT CAT GCACTT CC 2185 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2042 

AT GAGGAT GAAT C CGGGGAGGAGAGGCT GCCCT C CT GCTT T GACTAC GT CAT GCACT T CC 2101 
Qy 2186 

TGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCT 2245 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2102 

TGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCT 2161 
Qy 2246 

GGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACC 2305 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 2162 

GGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACC 2221 
Qy 2306 

T GGC CT C GCACT T CGGCT GCAC CATT GGT CT CAAAGATT CAGT CACAGCT GTT GT TTT CG 2365 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2222 

TGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCG 2281 
Qy 2366 



TGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATG 2425 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II 
Db 2282 

TGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGCAAAGCTGCTGCCCTCCAGGATG 2341 
Qy 2426 

TATATGCAGACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGG 2485 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2342 

TATATGCAGACGCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGG 2401 
Qy 2486 

GCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCC 2545 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I 
Db 2402 

GCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCC 2461 
Qy 2546 

ACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCT 2605 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2462 

ACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCT 2521 
Qy 2606 

GCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCC 2665 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2522 

GCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCC 2581 
Qy 2666 

GTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCT 2725 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 2582 

GTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCT 2641 

Qy 2726 TTGC CAC ACT AGAGGCCTATT GCT ACAT CAAGGGGT T C 2763 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2642 TTGCCACACT AGAGGCCTATT GCT ACAT CAAGGGGT T C 2679 
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Query Match 96.1%; Score 2657.6; DB 15; Length 2840; 

Best Local Similarity 97.6%; Pred. No. 0; 

Matches 2712; Conservative 0; Mismatches 54; Indels 12; 
Gaps 1; 



Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I i I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I II I 11 I I 
Db 63 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 122 
Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I I I I I I I I I I I II I I I I I I I I I ! I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 123 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 182 
Qy 121 

ACAGGGCAGAACAAT GAGT CCT GT T CAGGGTCAT C GGACT GCAAGGAGGGT GT CAT CCT G 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 183 

ACAGGGCAGAACAAT GAGT CCT GT T CAGGGTCAT C GGACT GCAAGGAGGGT GT CAT CCT G 242 
Qy 181 

CCAAT CT GGTACCC GGAGAAC C CTT CCCTT GGGGACAAGATT GCCAGGGT CATT GT CT AT 240 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 243 

C CAATCT GGTACC C GGAGAAC CCTT CCCT T GGGGACAAGAT T GCCAGGGT CATT GT CT AT 302 
Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 303 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 362 
Qy 301 

T CT ATT GAAGT CAT CAC CT CT CAAGAGAGGGAGGT GACAATTAAGAAACC CAAT GGAGAA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 363 

T CT ATT GAAGT CAT CACCT CT CAAGAGAGGGAGGT GACAATTAAGAAACC CAAT GGAGAA 422 
Qy 361 

ACC AGCACAACCACT AT T C GGGT CT GGAAT GAAACTGT CT C CAAC CTGACC CTT AT GGCC 420 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 423 

ACC AGCACAAC CAC T ATT C GGGT CT GGAAT GAAACT GT CT C CAAC CT GAC C CTT AT GGC C 482 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 483 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 542 
Qy 481 

AT T GCT GGT GAT CT GGGACCT T CT AC C ATT GT AGGGAGT GCAGCCTT CAACAT GTT CAT C 540 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 543 

ATT GCT GGT GAT CT GGGACCTT CTAC CATT GT AGGGAGT GC AGCCT T CAACATGT TC ATC 602 
Qy 541 

AT CATT GGCAT CTGTGT CTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 600 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 603 

AT CAT T GGCAT CT GTGT CT ACGT GAT CCCAGAC GGAGAGACT C GCAAGAT CAAGCAT CT A 662 
Qy 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 663 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 722 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 723 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 782 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 783 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 842 
Qy 781 

AT GC ACAAAAAGT AC C GCACAGACAAAC ACCGAGGAATT AT C AT AGAGAC AGAGGGT GAC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 843 

AT GCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGT GAC 902 
Qy 841 

CACCCTAAGGGC AT T GAGAT GGAT GGGAAAAT GAT GAAT T CC CAT TT T CT AGAT GGGAAC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 903 

CACCCTAAGGGCATTGAGAT GGATGGGAAAAT GAT GAATT CCCAT TTT CT AGAT GGGAAC 962 
Qy 901 

CTGGTGCCCCT GGAAGGGAAGGAAGT GGAT GAGT C C C GCAGAGAGATGATCCGGATT CT C 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 963 

CT GGT GCC CCT GGAAGGGAAGGAAGT GGAT GAGT C CCGCAGAGAGAT GATCC GGATT CTC 1022 
Qy 961 

AAGGATCT GAAGCAAAAACACC CAGAGAAGGACT T AGAT CAGCT GGT GGAGATGGCCAAT 1020 



Db 1023 

AAGGAT CT GAAGCAAAAACACC CAGAGAAGGACTT AGAT CAGCT GGTGGAGATGGCCAAT 1082 



Qy 1021 

T ACTAT GCT CTTT CC CACCAACAGAAGAGCC GCGC CT T CT ACCGT AT C CAAGCCACT CGT 1080 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1083 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGTGCCTTCTACCGTATCCAAGCCACTCGT 1142 
Qy 1081 

ATGATGACTGGTGCAGGCAAT AT CCT GAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 1140 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1143 

AT GAT GACT GGT GCAGGCAAT AT C CT GAAGAAACAT GCAGCAGAACAAGC CAAGAAGGCC 1202 
Qy 1141 

T CCAGCATGAGCGAGGT GC AC AC C GAT GAGC CTGAGGACTT T ATT T C CAAGGT CTT CTTT 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I 

Db 1203 

T CCAGCATGAGC GAGGT GC AC ACC GAT GAGCCTGAGGACTT T AT TT CCAAGGT CT T CTTT 12 62 
Qy 1201 

GAC CCAT GT T CTT ACCAGT GC CT GGAGAACT GTGGGGCTGT ACT CCT GACAGT GGT GAGG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I 
Db 1263 

GAC CCAT GTT CT T AC CAGT GC CT GGAGAACT GT GGGGCT GT ACT CCT GACAGT GGT GAGG 1322 
Qy 1261 

AAAGGGGGAGACAT GT CAAAGAC CAT GT AT GT GGACT ACAAAAC AGAGGAT GGTT CT GC C 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1323 

AAAGGGGGAGACATGTCAAAGACCAT GTATGTGGACTACAAAACAGAGGATGGTTCT GCC 1382 
Qy 1321 

AAT GCAGGGGCT GACT AT GAGT T CACAGAGGGC AC GGT GGTT CT GAAGCC AGGAGAGACC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1383 

AAT GCAGGGGCT GACT AT GAGTTCACAGAGGGCACGGT GGTTCT GAAGCCAGGAGAGACC 1442 
Qy 1381 

CAGAAGGAGTT CT CC GT GGGC AT AAT T GAT GACGACATTTTT GAGGAGGAT GAACACTT C 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1443 

CAGAAGGAGTT CT CCGT GGGCATAATT GAT GAC GACAT TTTTGAGGAGGATGAACACTT C 1502 
Qy 1441 

T T T GT AAGGT T GAGC AAT GT C CGC AT AGAGGAGGAGC AGCCAGAGGAGGGGAT GCCT CCA 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1503 

T TT GTAAGGTTGAGCAAT GT CC GCAT AGAGGAGGAGC AGC CAGAGGAGGGGAT GCCT CCA 1562 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 1563 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1622 
Qy 1561 

GTTAC CAT CTT GGAT GAT GAC CAT GCAGGCAT CTT CACTTTT GAAT GT GAT ACT ATT CAT 1620 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1623 

GTTAC CAT CTT GGAT GAT GAC CAT GCAGGCAT CTT CACTTTT GAAT GT GAT ACT ATT CAT 1682 
Qy 1621 

GT C AGT GAGAGT AT T GGT GT T AT GGAGGT CAAGGTTCT GC GGACAT CAGGT GC CCGGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I . 

Db 1683 

GT CAGT GAGAGT AT T GGT GT TAT GGAGGT CAAGGTTCT GC GGACAT CAGGT GC CCGGGGT 1742 
Qy 1681 

AC AGT CAT CGT CCCCTT T AGGAC AGT AGAAGGGACAGCCAAGGGT GGC GGT GAGGACTTT 1740 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1743 

ACAGTCAT CGT CCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 1802 
Qy 1741 

GAAGACAC AT AT GGGGAGT T GGAATTCAAGAAT GATGAAACT GT GAAAAC CATAAGGGTT 1800 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II 

I 

Db 1803 

GAAGACACAT AT GGGGAGTT GGAAT T CAAGAAT GAT GAAACTGT CAAAACAATT CACATC 1862 
Qy 1801 

AAAAT AGT AGAT GAGGAGGAAT ACGAAAGGCAAGAGAATT T CT T C ATT GC C CT T GGT GAA 1860 

II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 1863 

AAGGTAATT GAT GATGAGGCATAT GAGAAAAACAAGAATTACTTCATTGAGATGATGGGC 1922 

Qy 1861 CCGAAATGGATGGAACGT GGAAT ATCAGAT GTG 

ACAGACAGGAAGCTG 1908 

II I I I I I II III 

I I I I I II I I I I I I I 
Db 1923 

C C CC GCAT GGT GGAT AT GAGT TTT CAGAAAGCGCT CCT GTT AT CT C CAGAC AGGAAGCT G 1982 
Qy 1909 

ACTAT GGAAGAAGAGGAGGC CAAGAGGAT AGCAGAGAT GGGAAAGCC AGT ATT GGGT GAA 1968 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1983 

ACTAT GGAAGAAGAGGAGGC CAAGAGGAT AGCAGAGAT GGGAAAGCC AGT ATT GGGT GAA 2042 
Qy 1969 

CACCCCAAACT AGAAGT CAT CAT T GAAGAGT CCT AT GAGTT CAAGACT AC GGT GGACAAA 2028 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2043 

CACCCCAAACT AGAAGT CAT CAT T GAAGAGT C CT AT GAGT T CAAGACT AC GGT GGACAAA 2102 
Qy 2029 

CT GAT CAAGAAGACAAAC CT GGC CTT GGTT GT GGGGAC CCATT CCT GGAGGGACCAGTT C 2088 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 2103 

CT GATCAAGAAGACAAACCTGGC CTT GGTT GT GGGGACCCATT C CT GGAGGGACCAGTT C 2162 
Qy 2089 

ATGGAGGCCATCACCGTCAGTGCAGCAGGGGATGAGGATGAGGATGAATCCGGGGAGGAG 2148 

I I I I I I I I I I I II I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2163 

AT GGAGGCCATCACCGTCAGTGCAGCAGGGGAT GAGGATGAGGATGAAT CCGGGGAGGAG 2222 
Qy 2149 

AGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTG 2208 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2223 

AGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTG 2282 
Qy 2209 

TTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATC 2268 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2283 

TTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATC 2342 
Qy 2269 

CT CAT CATT GGCAT GCT CACC GC CAT CAT T GGGGAC CT GGC CT CGCACT T CGGCT GCAC C 2328 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 2343 

CTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACC 2402 
Qy 2329 

ATT GGT CT CAAAGAT T CAGT CACAGCT GTT GTT T T C GT GGCATT TGGCACCT CTGT CC C A 2388 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2403 

ATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCA 24 62 
Qy 2389 

GAT ACGTTTGCCAGCAAAGCT GCT GC C CT C CAGGAT GT AT ATGC AGACGC CT C CAT T GGC 2448 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2463 

GATACGT TT GCCAGCAAAGCT GCT GC CCT CCAGGAT GT AT AT GCAGACGCCT C C ATTGGC 2522 
Qy 2449 

AACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTG 2508 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2523 

AACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTG 2582 
Qy 2509 

GCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCC 2568 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2583 

GCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCC 2642 



Qy 2569 

TTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGA 2628 



I I I I I I I I I I II I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2643 

TTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGA 2702 
Qy 2629 

AGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACA 2688 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2703 

AGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACA 2762 
Qy 2689 

TGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGC 2748 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2763 

TGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGC 2822 

Qy 2749 TACATCAAGGGGTTCTAA 27 66 

I I I I I I I I I I I I I I I I I I 
Db 2823 TACATCAAGGGGTTCTAA 2840 
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Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 1 I I I I I II I I I I I 
Db 9 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 68 

Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 69 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 128 
Qy 121 

AC AGGGCAGAACAAT GAGT CCT GT T CAGGGT CAT CGGACT GCAAGGAGGGT GT C AT CCT G 180 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 129 

ACAGGGCAGAACAATGAGT C CT GTT CAGGGT CAT CGGACT GCAAGGAGGGT GT CAT CCTG 188 
Qy 181 

C CAATCT GGTACCCGGAGAACC CTT CC CTT GGGGACAAGATTGC CAGGGT CATT GT CT AT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 189 

C CAAT CT GGTACC CGGAGAACC CTT CCCTT GGGGACAAGATT GC CAGGGT CATT GT CTAT 248 
Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 249 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 308 
Qy 301 

TCTATTGAAGT CATCACCT CTCAAGAGAGGGAGGTGACAATTAAGAAACCCAAT GGAGAA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 309 

T CT ATT GAAGT CAT CAC CT CT CAAGAGAGGGAGGT GACAATTAAGAAACCCAAT GGAGAA 368 
Qy 361 

AC CAGCACAACC ACT ATT CGGGT CT GGAAT GAAACT GT CT CCAACCT GAC C CTT AT GGCC 420 

I I II I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I i I I I I I I I I I I III I 
Db 369 

ACCAGCACAAC CACT AT T CGGGT CT GGAAT GAAACT GT CT C CAACCT GACCCTT AT GGCC 428 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 429 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 488 
Qy 481 

ATT GCT GGT GAT CT GGGAC CTT CT ACC ATT GTAGGGAGT GCAGC CTT CAACAT GTT CAT C 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 489 

ATT GCT GGT GAT CT GGGAC CTT CTACCATT GTAGGGAGT GCAGCCTT CAACATGTT CAT C 548 
Qy 541 

ATCATTGGCATCT GTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGATCAAGCAT CTA 600 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 549 

AT CATTGGCAT CTGTGTCT ACGT GATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 608 
Qy 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 609 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 668 
Qy 661 



CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I i I I I I I 
Db 669 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 728 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 729 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 788 
Qy 781 

AT GC ACAAAAAGT AC C GCAC AGACAAACACC GAGGAAT TAT CAT AGAGAC AGAGGGTGAC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 789 

AT GC ACAAAAAGT ACC GCACAGACAAACACCGAGGAATTAT CAT AGAGAC AGAGGGTGAC 848 
Qy 841 

CACC CTAAGGGCATT GAGAT GGAT GGGAAAAT GAT GAATT CC CATT T TCT AGATGGGAAC 900 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 849 

CACCCTAAGGGCATTGAGATGGATGGGAAAATGATGAATTCCCATTTTCTAGATGGGAAC 908 
Qy 901 

CTGGTGCCC CT GGAAGGGAAGGAAGT GGAT GAGT CCC GCAGAGAGAT GAT CCGGAT TCT C 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 909 

CT GGT GC CCCT GGAAGGGAAGGAAGT GGAT GAGT CCCGCAGAGAGAT GAT CC GGATTCT C 968 
Qy 961 

AAGGATCTGAAGCAAAAACACCCAGAGAAGGACTTAGATCAGCT GGT GGAGATGGCCAAT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 969 

AAGGAT CT GAAGCAAAAACAC C C AGAGAAGGACTT AGATCAGCT GGT GGAGAT GGCCAAT 1028 
Qy 1021 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1029 

TACT AT GCT CTT T CC C ACCAAC AGAAGAGCCGT GC CTT CTACC GTAT CCAAGC CACT C GT 1088 
Qy 1081 

AT GAT GACT GGT GCAGGCAAT AT C CT GAAGAAACAT GCAGCAGAACAAGC CAAGAAGGCC 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1089 

AT GAT GACT GGT GCAGGCAAT ATC CTGAAGAAAC AT GCAGCAGAACAAGC CAAGAAGGCC 1148 
Qy 1141 

T CCAGCAT GAGCGAGGTGC ACACC GAT GAGCCT GAGGACT TT ATTT CCAAGGT CT T CT TT 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1149 



T CC AGCAT GAGCGAGGT GC ACAC C GATGAGCCT GAGGACTT T ATTT CCAAGGT CTT CTT T 1208 
Qy 1201 

GAC CCAT GT TCTT ACCAGT GCCT GGAGAACTGT GGGGCT GT ACT CCT GAC AGT GGT GAGG 1260 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1209 

GAC CCAT GT TCTT AC C AGT GCCT GGAGAACT GT GGGGCT GT ACT C CT GACAGTGGT GAGG 1268 
Qy 1261 

AAAGGGGGAGACAT GT CAAAGAC CAT GT AT GT GGACTACAAAACAGAGGAT GGTT CT GCC 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1269 

AAAGGGGGAGACAT GT CAAAGAC CAT GT AT GT GGACTACAAAACAGAGGAT GGT T CT GC C 1328 
Qy 1321 

AATGCAGGGGCTGACTAT GAGTT CACAGAGGGCACGGTGGTTCTGAAGCCAGGAGAGACC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1329 

AATGCAGGGGCTGACTAT GAGTT CACAGAGGGCACGGT GGTTCTGAAGCCAGGAGAGACC 1388 
Qy 1381 

CAGAAGGAGTTCT CCGTGGGCATAATT GAT GACGACATTTTTGAGGAGGAT GAACACTT C 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1389 

CAGAAGGAGTT CTCCGTGGGCATAATTGAT GACGACATTTTTGAGGAGGAT GAACACTT C 1448 
Qy 1441 

TT T GTAAGGTT GAGCAAT GT CC GC ATAGAGGAGGAGCAGCCAGAGGAGGGGAT GC CT CC A 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1449 

TTT GTAAGGTT GAGCAAT GT C C GCAT AGAGGAGGAGCAGC CAGAGGAGGGGAT GCCT CCA 1508 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1509 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1568 
Qy 1561 

GT T AC CAT CTT GGAT GAT GAC CAT GCAGGC AT CTT CACT TTTGAAT GT GAT ACT ATT CAT 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1569 

GTT ACCAT CTT GGAT GAT GAC CAT GCAGGCAT CTT CACT TT T GAAT GTGAT ACT AT T CAT 1628 
Qy 1621 

GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1629 

GT CAGT GAGAGT ATT GGT GT T AT GGAGGT CAAGGTT CT GCGGACAT C AGGT GCC C GGGGT 1688 
Qy 1681 

ACAGT CAT CGTCCCCTT T AGGACAGT AGAAGGGACAGCCAAGGGT GGCGGT GAGGACTTT 1740 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1689 

ACAGTCATCGT C CCCTT T AGGACAGT AGAAGGGACAGCCAAGGGT GGCGGT GAGGACTT T 1748 
Qy 1741 

GAAGACACATAT GGGGAGTT GGAATTCAAGAATGATGAAACTGTGAAAACCATAAGGGTT 1800 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

I I I 

Db 1749 

GAAGACACATAT GGGGAGTT GGAATT CAAGAAT GAT GAAACT GT GAAAACT CT TCAGGT G 1808 



Qy 1801 

AAAAT AGT AGAT GAGGAGGAAT AC GAAAGGCAAGAGAATTTCT T CATT GCC CTT GGT GAA 1860 



Db 1809 

AAGAT AGTT GAT GACGAGGAAT AT GAGAAAAAGGATAAT TTCT T CATT GAGCT GGGCCAG 1868 

Qy 1861 C CGAAAT G GAT GGAACGT GGAAT AT C AGAT GT GACA 

GACAGG 1902 

II I I I I I I II II M MM I II I 

I I I I I I 

Db 1869 

C C C C AGT GGCTTAAGCGAGGGATT T C AGCTCT GCT ACT CAAT CAAGGGGAT GGGGACAGG 1928 



Qy 1903 

AAGCTGACTATGGAAGAAGAGGAGGCCAAGAGGATAGCAGAGATGGGAAAGCCAGTATTG 1962 

I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

I 

Db 1929 

AAGCTAACAGC C GAGGAGGAGGAGGCT C GGAGGAT AGC AGAGATGGGCAAGCCAGT T CTT 1988 



Qy 1963 

GGTGAACACCCCAAACTAGAAGTCAT CATTGAAGAGT CCTATGAGTT CAAGACTACGGTG 2022 

MM II I I I I I I I I I I I I I I I I I I I I M I II I I I I I I 

I I I I I I 

Db 1989 

GGGGAGAACTGCCGGCT GGAGGTCAT CAT CGAGGAGT CATATGATTTTAAGAACACGGTG 204 8 
Qy 2023 

GACAAACT GAT CAAGAAGACAAAC CT GGC CTT GGT T GTGGGGACC C AT TC CTGGAGGGAC 2082 

II I I I I I M I I I I I I M Ml I I I I I I I M I I I I I I M I I I I I 

I II I I I I I 

Db 2049 

GATAAACTCAT CAAGAAAACGAACTTGGCCTT GGTAATTGGGACCCATTCATGGAGGGAG 2108 

Qy 2083 CAGT T CAT GGAGGC CAT C AC C GT C AGTGC AGCAGGGGAT GAGGAT GAGGAT GAA — 

TCC 2139 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2109 

C AGTTTTTAGAGGCAAT T AC GGT GAGCGCAGGGGAC GAGGAGGAGGAGGAGGAC GGGT CC 2168 



Qy 2140 

GGGGAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGG 2199 

I I II I I I I I I I I II I II I II I I I II I I I I I I I I I I I I I I I I I I I I II 

II I I I I 

Db 2169 

CGGGAGGAGCGGCTGCCGTCGTGCTTTGACTACGTGATGCACTTCCTGACGGTGTTCTGG 2228 



Qy 2200 

AAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCC 2259 

I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2229 

AAGGTGCTCTTCGCCTGTGTGCCCCCCACCGAGTACTGCCACGGCTGGGCCTGCTTTGGT 2288 



Qy 2260 

GTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTC 2319 

I I I I I I I I I I I I I I I III I I I I I I I I I I I I II I I I I I I I I I I 1 I I I I 

I I I I I I 

Db 2289 

GTCTCCATCCTGGTCATCGGCCTGCTCACCGCCCTCATTGGGGACCTCGCCTCCCACTTC 2348 



Qy 2320 

GGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACC 2379 

I I I I I I I I I I I I I I I I I I II II I I II I I I I I I I I Mill II I 

I I I I I I 

Db 2349 

GGCTGCACCGTTGGCCTCAAGGACTCTGTCAATGCTGTTGTCTTCGTTGCCCTGGGCACC 2408 



Qy 2380 

T CT GT C CCAGAT ACGT TT GCCAGCAAAGCT GCT GCCCT CC AGGAT GT AT AT GCAGAC GC C 2439 

II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II 

I I I I I 

Db 2409 

T CCATC C CTGACACGTT CGCCAGCAAGGT GGCGGCGCT GCAGGACCAGT GC GCCGAC GCG 2468 



Qy 2440 

TCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCC 2499 

I I I I I I I I I I I I I I I I III I I II I I II II II I I I I I II I I III I 

I I I 

Db 2469 

TCCATCGGCAACGTGACCGGCTCCAACGCGGTGAACGTGTTCCTTGGCCTGGGCGTCGCC 2528 



Qy 2500 

TGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGC 2559 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2529 

TGGTCTGTGGCCGCCGTGTACTGGGCGGTGCAGGGCCGCCCCTTCGAGGTGCGCACTGGC 2588 



Qy 2560 

ACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTC 2619 
II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II II II I I I I 

I I I I I I 

Db 2589 

ACGCTGGCCTTCTCCGTCACGCTCTTCACCGTCTTCGCCTTCGTGGGCATTGCCGTGCTG 2648 



Qy 2620 

TTGTACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTC 2679 
I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

I I I I I I I 

Db 2649 

CTGTACCGGCGCCGGCCGCACATCGGCGGCGAGCTGGGCGGCCCGCGCGGACCCAAGCTC 2708 



Qy 2680 

GCCACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAG 2739 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I I 



Db 2709 

GCCACCACCGCGCTCTTCCTGGGCCTCTGGCTCCTGTACATCCTCTTCGCCAGCCTGGAG 2768 



Qy 2740 GCCTATTGCTACATCAAGGGGTTCTA 2765 

I I I I I I I I I I I I I I I I I I I I 
Db 2769 GCGTACTGCCACATCCGGGGCTTCTA 2794 



RESULT 8 
US-10-256-537-1 

; Sequence 1, Application US/10256537 
; Publication No. US20030162196A1 
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; PRIOR FILING DATE: 2001-09-28 
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SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 1 

LENGTH: 2534 
; TYPE: DNA 
; ORGANISM: Homo sapien 
US-10-256-537-1 

Query Match 64.6%; Score 1786.4; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1787; Conservative 0; Mismatches 
Gaps 0; 

Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 343 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 402 

Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I I I I II I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 403 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 4 62 

Qy 121 

AC AGGGCAGAACAAT GAGT CCT GT T C AGGGT CAT CGGACT GCAAGGAGGGT GT C AT CCT G 180 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 463 

ACAGGGC AGAACAAT GAGT CCT GTT CAGG GT CAT C GGACT GCAAGGAGGGTGTCAT CCT G 522 
Qy 181 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 523 



DB 15; Length 2534; 
1; Indels 0; 



CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 582 
Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 583 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 642 
Qy 301 

T CT ATT GAAGTCATC ACCT CT CAAGAGAGGGAGGT GACAATTAAGAAAC C CAATGGAGAA 360 

I I I I I I I I I I I I II I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 643 

T CT ATT GAAGTCATCACCT CT CAAGAGAGGGAGGT GACAATTAAGAAACC CAAT GGAGAA 702 
Qy 361 

ACC AGCACAACCACT ATT CGGGT CT GGAAT GAAACT GT CT C CAACCT GAC CCTT AT GGCC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 703 

AC CAGCACAACC ACT ATT CGGGT CT GGAAT GAAACT GT CT CCAACCT GAC CCTT AT GGCC 762 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 763 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 822 
Qy 481 

AT T GCT GGT GAT CT GGGACCT T CT AC CATT GT AGGGAGTGCAGCCTT CAACAT GTT CAT C 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 823 

ATT GCT GGT GAT CT GGGAC CTT CT ACC ATT GT AGGGAGT G CAGCCT T CAACAT GTT CAT C 882 
Qy 541 

AT CAT T GGCATCT GT GTCT ACGTGAT CC CAGAC GGAGAGACT C GCAAGAT CAAGCATCT A 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 883 

AT CATT GGC AT CT GT GT CT AC GT GAT C CCAGAC GGAGAGACT CGCAAGAT CAAGC AT CTA 942 
Qy 601 

CGAGT CT T CTT C ATCACC GCT GCTT GGAGT AT CTTT GC CT ACAT CT GGCT CT AT AT GATT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 943 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 1002 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1003 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 1062 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 

Db 1063 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 1122 
Qy 781 

AT GCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGT GAC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1123 

AT GCACAAAAAGT AC CGCACAGACAAACACCGAGGAATT AT C AT AGAGACAGAGGGT GAC 1182 
Qy 841 

C AC CCTAAGGGCAT T GAGATGGATGGGAAAATGAT GAATTCC CAT T TT CT AGAT GGGAAC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1183 

CACCCTAAGGGCATT GAGATGGAT GGGAAAATGAT GAATT C CCAT TTT CT AGAT GGGAAC 1242 
Qy 901 

CT GGT GC CCCT GGAAGGGAAGGAAGT GGAT GAGT CCCGCAGAGAGAT GAT CCGGATT CT C 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1243 

CTGGTGCCCCT GGAAGGGAAGGAAGT GGAT GAGT C CCGCAGAGAGAT GAT CC GGATT CT C 1302 
Qy 961 

AAGGAT CT GAAGCAAAAACAC CCAGAGAAGGACT TAGAT CAGCT GGT GGAGAT GGCCAAT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1303 

AAGGAT CT GAAGCAAAAACACC CAGAGAAGGACTT AGAT CAGCTGGT GGAGAT GGC CAAT 1362 
Qy 1021 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 



Db 1363 

TACT AT GCT CTTT CCCAC CAACAGAAGAGCCGC GC CTT CT AC C GT AT C CAAGC CACT C GT 1422 
Qy 1081 

ATGAT GACT GGTGCAGGCAATATCCT GAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1423 

AT GAT GACT GGT GCAGGCAAT AT CCT GAAGAAACATGCAGCAGAACAAGC CAAGAAGGCC 1482 
Qy 1141 

T CCAGCAT GAGC GAGGT GC ACACCGAT GAGCCT GAGGACTTT ATTT C CAAGGT CTT CT TT 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1483 

T C C AGC AT GAGCGAGGT GCACACC GAT GAGCCT GAGGACTT T AT T T CCAAGGT CTT CTT T 1542 
Qy 1201 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1543 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1602 



Qy 1261 

AAAGGGGGAGAC AT GT CAAAGACCAT GT ATGT GGACT ACAAAAC AGAGGAT GGTT CT GCC 1320 

I I I I I I I I ! I I I I I I I I I I I II I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1603 

AAAGGGGGAGACAT GT CAAAGACCAT GT AT GT GGACT ACAAAAC AGAGGAT GGTT CT GCC 1662 
Qy 1321 

AAT GCAGGGGCT GACT ATGAGTT CAC AGAGGGC ACGGT GGTT CT GAAGC CAGGAGAGACC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1663 

AAT GCAGGGGCT GACT AT GAGTT CACAGAGGGC AC GGT GGTT CT GAAGCCAGGAGAGACC 1722 
Qy 1381 

CAGAAGGAGTTCT CCGTGGGCATAATTGATGACGACATTTTT GAGGAGGATGAACACTTC 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1723 

CAGAAGGAGTT CT CCGT GGGCATAAT TGATGACGACAT T TTT GAGGAGGAT GAAC ACTTC 1782 
Qy 1441 

TTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 1500 

I I I I I I I I I I I I II I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1783 

TTT GTAAGGTT GAGCAATGT CCGC AT AGAGGAGGAGC AGC C AGAGGAGGGGAT GCCT C CA 1842 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1843 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1902 
Qy 1561 

GTT ACCAT CT T GGATGAT GACCAT GC AGGCAT CT T C ACTT T T GAAT GT GAT ACT ATT CAT 1620 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1903 

GTT AC CAT CTT GGAT GAT GACCATGCAGGCAT CTT CACTTTT GAAT GT GAT ACT ATT CAT 1962 
Qy 1621 

GT CAGTGAGAGT ATTGGTGTTAT GGAGGT CAAGGTT CT GCGGAC AT CAGGT GCCCGGGGT 1680 

I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I 

Db 1963 

GT CAGT GAGAGT ATT GGT GTTAT GGAGGT CAAGGT T CT GC GGAC AT CAGGT GCCC GGGGT 2022 
Qy 1681 

ACAGT CATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGT GGCGGT GAGGACTTT 1740 

II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 2023 

ACAGT CAT CGT CC C CTT T AGGAC AGT AGAAGGGAC AGC CAAGGGT GGCGGT GAGGACT TT 2082 

Qy 1741 GAAGACAC AT AT GGGGAGTT GGAATT CAAGAAT GATGAAACTGT GAAA 1788 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 2083 GAAGACACAT AT GGGGAGTT GGAATT CAAGAAT GATGAAACTGT GTAA 2130 



RESULT 9 
US-10-256-537-3 

; Sequence 3, Application US/10256537 
; Publication No. US20030162196A1 
; GENERAL INFORMATION: 
; APPLICANT: Carroll, Joseph M. 

; TITLE OF INVENTION: METHODS OF USING 69039, A NOVEL HUMAN 

; TITLE OF INVENTION: NA/CA EXCHANGER FAMILY MEMBER 

; FILE REFERENCE: MPI01-231P1RM 

; CURRENT APPLICATION NUMBER: US/ 10/256, 537 

; CURRENT FILING DATE: 2002-04-19 

; PRIOR APPLICATION NUMBER: 60/325,737 

; PRIOR FILING DATE: 2001-09-28 

; NUMBER OF SEQ ID NOS : 5 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 3 

LENGTH: 2534 
; TYPE: DNA 
; ORGANISM: Homo sapien 

FEATURE: 

NAME/ KEY: CDS 
; LOCATION: (343) ... (2130) 
US-10-256-537-3 



Query Match 64.6%; 
Best Local Similarity 99.9%; 
Matches 1787; Conservative 
Gaps 0; 



Score 1786.4; DB 15; Length 2534 
Pred. No. 0; 
0; Mismatches 1; Indels 0; 



Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 343 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 402 
Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 12 0 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 403 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 4 62 
Qy 121 

ACAGGGCAGAACAATGAGTCCTGTT CAGGGTCAT CGGACT GCAAGGAGGGTGTCATCCTG 180 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 463 

ACAGGGC AGAACAAT GAGT CCT GTT C AGGGT CAT C GGACT GCAAGGAGGGT GT CAT C CT G 522 
Qy 181 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 523 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 582 
Qy 241 



TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I I I I I I I II I I I II I I I III II I I I I I I I II I I I I I I I III I I II I II I I I M I I! I II I 
Db 583 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 642 
Qy 301 

TCTATTGAAGTCAT CACCTCTCAAGAGAGGGAGGTGACAATTAAGAAACCCAAT GGAGAA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 643 

T CT ATT GAAGTCAT CAC CT CT CAAGAGAGGGAGGT GACAATTAAGAAAC CCAAT GGAGAA 7 02 
Qy 361 

ACCAGCACAACCACTATT CGGGTCT GGAATGAAACT GT CTCCAACCTGACCCTTAT GGCC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 
Db 703 

AC CAGCACAACC ACT ATT CGGGT CT GGAAT GAAACT GT CT CCAAC CT GACCCTT AT GGC C 7 62 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 763 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 822 
Qy 481 

ATT GCT GGT GAT CT GGGACCTT CT AC CATTGT AGGGAGT GCAGCCT T CAACAT GTT CAT C 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 823 

ATTGCTGGTGATCTGGGACCTTCTACCATTGTAGGGAGTGCAGCCTTCAACATGTTCATC 882 
Qy 541 

ATCATTGGCATCTGTGTCTACGT GATCCCAGACGGAGAGACT CGCAAGAT CAAGCATCTA 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I 
Db 883 

AT CATT GGC AT CT GT GT CTACGT GAT CC C AGACGGAGAGACT C GCAAGAT CAAGCAT CT A 942 
Qy 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 943 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 1002 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I II I II I I I I I 
Db 1003 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 1062 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1063 



TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 1122 
Qy 781 

ATGC ACAAAAAGT AC C GC AC AGACAAAC ACCGAGGAAT TAT CATAGAGACAGAGGGT GAC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1123 

AT GC ACAAAAAGT AC C GC AC AGACAAAC ACCGAGGAAT TAT CATAGAGACAGAGGGT GAC 1182 
Qy 841 

CACC CTAAGGGCATT GAGAT GGAT GGGAAAAT GAT GAATT C CCATTTTCT AGAT GGGAAC 900 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1183 

CACC CTAAGGGCATT GAGAT GGAT GGGAAAAT GAT GAATT CCCATTTTCT AGAT GGGAAC 1242 
Qy 901 

CT GGTGC CC CTGGAAGGGAAGGAAGT GGAT GAGT CCCGCAGAGAGAT GAT CCGGAT T CT C 960 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1243 

CT GGT GCCC CT GGAAGGGAAGGAAGT GGAT GAGT C CC GCAGAGAGAT GAT CCGGATT CT C 1302 
Qy 961 

AAGGAT CT GAAGCAAAAACACCCAGAGAAGGACTT AGAT C AGCT GGT GGAGAT GGC CAAT 1020 



Db 1303 

AAGGAT CTGAAGCAAAAACAC C C AGAGAAGGACTT AGAT CAGCT GGT GGAGAT GGC CAAT 1362 
Qy 1021 

T ACT AT GCT CTTT CC C AC CAAC AGAAGAGC CGC GC CTT CT ACC GTAT CCAAGC CACT CGT 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1363 

TACT AT GCT CTTT CC C AC CAAC AGAAGAGC C GC GC CTT CT ACC GTAT CCAAGC CACT C GT 1422 
Qy 1081 

ATGATGACTGGTGCAGGCAATAT CCT GAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 1140 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1423 

ATGATGACTGGTGCAGGCAATATCCTGAAGAAACATGCAGCAG7U\CAAGCCAAGAAGGCC 14 82 
Qy 1141 

T CCAGCATGAGCGAGGTGCAC ACCGAT GAGC CTGAGGACTT TATTT C CAAGGT CTT CT T T 1200 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1483 

T CCAGC AT GAGC GAGGTGCACAC C GAT GAGC CT GAGGACT T TATTT CCAAGGT CTT CT TT 1542 
Qy 1201 

GAC CCAT GTT CTT ACC AGT GC CTGGAGAACT GT GGGGCT GT ACT CCT GACAGT GGT GAGG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1543 

GACC C AT GT T CTT ACCAGT GCCTGGAGAACT GT GGGGCT GTACT CCT GACAGT GGT GAGG 1602 
Qy 1261 

AAAGGGGGAGACAT GT CAAAGACCAT GTAT GT GGACTACAAAACAGAGGATGGTT CTGCC 1320 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1603 

AAAGGGGGAGACAT GT CAAAGAC CAT GT AT GT GGACTACAAAACAGAGGAT GGTT CT GC C 1662 
Qy 1321 

AAT GC AGGGGCT GACTAT GAGT T CACAGAGGGCAC GGT GGT T CT GAAGCC AGGAGAGAC C 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 11 
Db 1663 

AAT GC AGGGGCT GACTAT GAGTT CACAGAGGGCAC GGT GGTT CT GAAGCCAGGAGAGACC 1722 
Qy 1381 

CAGAAGGAGT T CT C CGT GGGCATAATT GAT GACGACATTTT T GAGGAGGAT GAACACTT C 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1723 

CAGAAGGAGT T CT C CGT GGGCATAATT GAT GACGACATTT TT GAGGAGGAT GAACACTT C 1782 
Qy 1441 

TTT GTAAGGT T GAGCAAT GT C C GCATAGAGGAGGAGCAGC CAGAGGAGGGGAT GC CT CCA 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I 
Db 1783 

TTT GTAAGGTT GAGCAAT GT C CGC ATAGAGGAGGAGCAGC CAGAGGAGGGGATGC CT CCA 1842 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1843 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1902 
Qy 1561 

GTTACCAT CTTGGATGATGACCAT GCAGGCATCTTCACTTTTGAATGTGATACTATT CAT 1620 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1903 

GTTAC CAT CT T GGAT GAT GACCAT GC AGGCAT CTT CACTT T T GAAT GT GAT ACT ATT CAT 1962 
Qy 1621 

GT CAGT GAGAGT ATT GGT GTT AT GGAGGTCAAGGT T CT GC GGACAT C AGGT GCC C GGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1963 

GT CAGTGAGAGT AT T GGT GT T AT GGAGGTCAAGGTTCT GCGGAC AT CAGGT GC CCGGGGT 2022 
Qy 1681 

ACAGTCAT CGT CCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGTGGCGGTGAGGACTTT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 2023 

ACAGT CATCGTCCCCTTTAGGACAGTAGAAGGGACAGCCAAGGGT GGCGGTGAGGACTTT 2082 

Qy 1741 GAAGAC ACAT AT GGGGAGT T GGAAT T CAAGAAT GAT GAAACT GT GAAA 1788 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 2083 GAAGACACATATGGGGAGTT GGAATT CAAGAAT GAT GAAACT GTGTAA 2130 



RESULT 10 



US-09-804-474A-3 

; Sequence 3, Application US/09804474A 

; Patent No. US20020119518A1 

; GENERAL INFORMATION: 

; APPLICANT: KODET, Stefan et al 

; TITLE OF INVENTION: ISOLATED HUMAN TRANSPORTER PROTEINS, 
; TITLE OF INVENTION: NUCLEIC ACID MOLECULES ENCODING HUMAN 
TRANSPORTER PROTEINS, 

; TITLE OF INVENTION: AND USES THEREOF 
; FILE REFERENCE: CL000891 

; CURRENT APPLICATION NUMBER: US/09/804, 474A 
; CURRENT FILING DATE: 2001-03-13 
; NUMBER OF SEQ ID NOS : 4 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 3 

LENGTH: 126512 

TYPE: DNA 

ORGANISM: Human 

FEATURE: 

NAME/ KEY: misc_f eature 
; LOCATION: ( 1) ...( 126512 ) 
; OTHER INFORMATION: n = A,T,C or G 
US-09-804-474A-3 

Query Match 64.5%; Score 1784.8; DB 9; Length 126512; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1786; Conservative 0; Mismatches 2; Indels 0; 
Gaps 0; 

Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2010 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 2069 
Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2070 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 2129 
Qy 121 

ACAGGGCAGAACAAT GAGT C CT GTT CAGGGT C AT CGGACT GCAAGGAGGGT GT CATC CT G 180 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2130 

ACAGGGCAGAACAAT GAGTCCT GT T CAGGGT CAT CGGACT GCAAGGAGGGT GT CAT C CT G 2189 

Qy 181 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2190 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 2249 



Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 2250 

T TT GTGGCC CT GATAT ACATGT T C CTT GGGGT GT CCAT CATTGCT GACC GCT T CAT GGCA 2309 
Qy 301 

T CT ATT GAAGT CATCACCT CT CAAGAGAGGGAGGTGACAATTAAGAAAC C CAAT GGAGAA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I 
Db 2310 

T CT ATTGAAGTCATCACCT CT CAAGAGAGGGAGGTGACAATTAAGAAACCCAATGGAGAA 2369 
Qy 361 

ACCAGCACAACCACTATT CGGGT CT GGAAT GAAACT GTCTCCAACCTGACCCTTATGGCC 420 
I I I I I I I II I I 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2370 

AC C AGCACAACAACT ATT C GGGT CT GGAAT GAAACT GT CT CCAACCT GAC CCTTAT GGC C 2429 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2430 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 2489 
Qy 481 

ATT GCT GGT GAT CT GGGAC CTT CT ACCATT GT AGGGAGT GC AGCCTT CAACAT GT T CAT C 540 

I I I I I I II I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2490 

ATT GCT GGT GAT CTGGGACCT T CT ACCAT T GT AGGGAGT GCAGC CT T CAACAT GTT C AT C 2549 
Qy 541 

AT CATT GGC ATCT GT GT CT AC GT GAT CC C AGACGGAGAGACT C GCAAGAT CAAGCATCTA 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 2550 

AT CAT T GGCAT CT GT GTCT ACGT GAT C CCAGAC GGAGAGACT CGCAAGAT CAAGC AT CT A 2609 
Qy 601 

CGAGT CTTCT T CATC ACCGCT GCT T GGAGT AT CT TT GC CT ACAT CT GGCT CT ATAT GATT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2610 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 2669 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I I I I I I I I I I I I I II I I I I I I I I I II I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2670 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 2729 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2730 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 2789 



Qy 781 

ATGC ACAAAAAGT AC C GCACAGACAAAC AC CGAGGAATT AT C AT AGAGACAGAGGGT GAC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I II I I I I I I I I I I I I I I I I I I I I 
Db 2790 

ATGCACAAAAAGT AC C GCACAGACAAAC AC CGAGGAATT AT CAT AGAGACAGAGGGT GAC 2849 
Qy 841 

CAC C CT AAGGGCATT GAGAT GGAT GGGAAAAT GATGAATTC C CAT TTT CT AGAT GGGAAC 900 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 2850 

CAC C CTAAGGGCATT GAGAT GGAT GGGAAAAT GAT GAATTC C CATTTT CT AGAT GGGAAC 2909 
Qy 901 

CTGGT GCCCCTGGAAGGGAAGGAAGTGGAT GAGT CCCGCAGAGAGATGATCCGGATT CTC 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I 
Db 2910 

CTGGT GCC C CT GGAAGGGAAGGAAGT GGAT GAGT CCC GCAGAGAGAT GAT CC GGATT CT C 2969 
Qy 961 

AAGGATCT GAAGCAAAAAC ACCCAGAGAAGGACT TAGAT CAGCT GGT GGAGAT GGC CAAT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2970 

AAGGAT CT GAAGCAAAAAC ACC CAGAGAAGGACTT AGAT CAGCT GGT GGAGAT GGC CAAT 3029 
Qy 1021 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3030 

T ACT ATGCT CTT T C C CAC CAAC AGAAGAGCCGCGC CT T CT ACC GT AT CCAAGC C ACT CGT 3089 
Qy 1081 

ATGAT GACTGGT GCAGGCAATATCCT GAAGAAACATGCAGCAGAACAAGCCAAGAAGGCC 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3090 

AT GAT GACT GGT GCAGGCAATAT CCT GAAGAAACATGCAGCAGAACAAGC CAAGAAGGCC 314 9 
Qy 1141 

T CCAGCAT GAGCGAGGT GCACAC C GAT GAGCCT GAGGACTT T AT TT CCAAGGT CTT CTTT 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I 
Db 3150 

T CCAGCAT GAGCGAGGTGC ACAC C GAT GAGCCT GAGGACTT TATTT C CAAGGT CTT CT TT 3209 
Qy 1201 

GAC C CAT GT T CT T AC C AGT GC CT GGAGAACT GT GGGGCT GT ACTCCTGAC AGT GGT GAGG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3210 

GACC CATGTT CTTAC CAGT GCCT GGAGAACT GT GGGGCT GTACTCCTGACAGT GGT GAGG 32 69 
Qy 1261 

AAAGGGGGAGACAT GT CAAAGACCAT GT AT GT GGACT ACAAAAC AGAGGATGGTTCT GC C 1320 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 



Db 3270 

AAAGGGGGAGACAT GT CAAAGACCAT GT AT GT GGACT ACAAAACAGAGGAT GGTTCT GC C 3329 
Qy 1321 

AAT GCAGGGGCT GACT AT GAGTT CACAGAGGGCAC GGTGGTT CT GAAGC CAGGAGAGAC C 1380 



Db 3330 

AAT GCAGGGGCT GACT AT GAGT T CACAGAGGGCACGGT GGTT CT GAAGC CAGGAGAGAC C 3389 
Qy 1381 

CAGAAGGAGTT CT CCGT GGGCATAATTGATGACGACATTTTT GAGGAGGAT GAACACTT C 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I 
Db 3390 

CAGAAGGAGTT CTCCGTGGGCATAATTGATGACGACATTTTT GAGGAGGAT GAACACTT C 3449 
Qy 1441 

T TT GTAAGGTT GAGCAAT GT CC GCAT AGAGGAGGAGC AGC CAGAGGAGGGGAT GCCT CC A 1500 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I 
Db 3450 

T TT GTAAGGTT GAGCAAT GT C C GCAT AGAGGAGGAGC AGC CAGAGGAGGGGAT GCCT CCA 3509 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3510 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 3569 
Qy 1561 

GTT ACCAT CTT GGAT GAT GAC CAT GC AGGCAT CT T C ACTTTT GAAT GT GAT ACT AT T CAT 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3570 

GTTACCATCTT GGAT GAT GAC CAT GCAGGCAT CTTCACTTTTGAATGT GATACTATTCAT 3629 
Qy 1621 

GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3630 

GTCAGTGAGAGTATT GGT GTTATGGAGGTCAAGGTTCTGCGGACAT CAGGTGCCCGGGGT 3689 
Qy 1681 

AC AGTC AT CGT C C C CTTT AGGACAGT AGAAGGGACAGCCAAGGGT GGCGGT GAGGACTTT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3690 

ACAGT CAT C GT C CCCTTT AGGACAGT AGAAGGGACAGCCAAGGGT GGCGGT GAGGACTT T 3749 

Qy 1741 GAAGACACAT AT GGGGAGTT GGAAT T CAAGAAT GAT GAAACT GT GAAA 1788 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 3750 GAAGACACAT AT GGGGAGTT GGAATT CAAGAAT GAT GAAACT GT GTAA 3797 



RESULT 11 
US-10-054-680-3 

; Sequence 3, Application US/10054680 



; Publication No. US20020132998A1 
; GENERAL INFORMATION: 
; APPLICANT: Friddle, Carl Johan 
; APPLICANT: Hilbun, Erin 

; TITLE OF INVENTION: No. US20020132998Alel Human Ion Exchanger 

Proteins and Polynucleotides Encoding the 

; TITLE OF INVENTION: Same 

; FILE REFERENCE: LEX-0301-USA 

; CURRENT APPLICATION NUMBER: US/10/054,680 

; CURRENT FILING DATE: 2002-01-22 

; PRIOR APPLICATION NUMBER: US 60/263,384 

; PRIOR FILING DATE: 2001-01-23 

; NUMBER OF SEQ ID NOS : 5 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 3 

LENGTH: 1863 

TYPE: DNA 
; ORGANISM: homo sapiens 
US-10-054-680-3 

Query Match 64.5%; Score 1784.6; DB 13; Length 1863; 

Best Local Similarity 98.5%; Pred. No. 0; 

Matches 1813; Conservative 0; Mismatches 24; Indels 4; 
Gaps 1; 

Qy 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 

ATGGCGTGGTTAAGGTTGCAGCCTCTCACCTCTGCCTTCCTCCATTTTGGGCTGGTTACC 60 
Qy 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 61 

TTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCCAAGC 120 
Qy 121 

AC AGGGCAGAACAAT GAGT C CT GT T CAGGGT CAT C GGACT GCAAGGAGGGT GT CAT C CT G 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 

ACAGGGCAGAACAATGAGTCCT GTTCAGGGTCAT CGGACT GCAAGGAGGGT GTCATCCTG 180 
Qy 181 

CCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTAT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 

C CAAT CT GGT ACC C GGAGAAC CCT T CC CTT GGGGACAAGAT T GCCAGGGT CAT T GT CT AT 240 
Qy 241 

TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 

. TTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCA 300 



Qy 301 

T CT ATT GAAGT CAT CACCT CTCAAGAGAGGGAGGT GACAATT AAGAAACCCAAT GGAGAA 360 

II I I I I I I I I I I I I I II I I I I 1 I 1 II I I I I I I I I I II I I II I I I I I I I I I I I I I I II I I I 
Db 301 

T CT ATT GAAGT CAT CACCT CTCAAGAGAGGGAGGTGACAATTAAGAAAC CCAAT GGAGAA 360 
Qy 361 

ACC AGC ACAACCACTATTCGGGT CT GGAAT GAAACT GT CT CCAAC CT GACC CTT AT GGCC 420 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 

AC CAGCACAACC ACT ATTCGGGT CT GGAAT GAAACT GT CT CCAACCTGACC CTT ATGGCC 420 
Qy 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 

CTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTC 480 
Qy 481 

ATT GCT GGT GATCT GGGACCTT CTAC CATT GT AGGGAGT GCAGC CT TCAAC ATGTTC ATC 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 481 

ATT GCT GGT GAT CT GGGACCTT CTAC CATT GT AGGGAGT GCAGC CTT CAACATGTT CATC 540 
Qy 541 

AT CATT GGCAT CT GT GT CT ACGT GAT C C CAGAC GGAGAGACT C GCAAGATCAAGCAT CTA 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 541 

AT CATTGGCAT CTGTGT CTACGT GATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTA 600 
Qy 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 

CGAGTCTTCTTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATT 660 
Qy 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 661 

CTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTC 720 
Qy 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 

TTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTGCTCTTCTACAAATAC 780 
Qy 781 

ATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGAC 840 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 781 

AT GCACAAAAAGT AC CGCACAGACAAACACC GAGGAATT AT C AT AGAGACAGAGGGT GAC 840 
Qy 841 

CAC CCTAAGGGC ATT GAGAT GGAT GGGAAAAT GAT GAATT CC CAT TTT CT AGAT GGGAAC 900 

I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 841 

CAC CCTAAGGGC AT T GAGAT GGAT GGGAAAAT GAT GAAT T CC CAT TTT CT AGAT GGGAAC 900 
Qy 901 

CTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATTCTC 960 

I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 901 

CTGGT GCCCCTGGAAGGGAAGGAAGTGGATGAGT CCCGCAGAGAGAT GATCCGGATT CTC 960 
Qy 961 

AAGGAT CT GAAG C AAAAACAC C CAGAGAAGGACT T AGAT CAGCT GGT GGAGAT GGC CAAT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 961 

AAGGATCTGAAGCAAAAACACCCAGAGAAGGACT T AGAT CAGCT GGT GGAGAT GGCCAAT 1020 
Qy 1021 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 

TACTATGCTCTTTCCCACCAACAGAAGAGCCGCGCCTTCTACCGTATCCAAGCCACTCGT 1080 
Qy 1081 

AT GAT GACT GGT GCAGGCAAT AT C CT GAAGAAACAT GCAGCAGAACAAGCCAAGAAGGCC 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 

AT GAT GACT GGT GCAGGCAAT AT C CT GAAGAAACAT GCAGCAGAACAAGCCAAGAAGGCC 1140 
Qy 1141 

T CCAGC AT GAGCGAGGT GCACAC CGAT GAGCCTGAGGACTTT ATTT CCAAGGT CTT CTTT 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 

T CCAGCATGAGC GAGGT GCACACC GAT GAGCCTGAGGACTTT ATT T CCAAGGT CTTCTTT 1200 
Qy 1201 

GACCCAT GTT CTT AC C AGT GC CT GGAGAACT GT GGGGCT GT ACTC CT GACAGT GGT GAGG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1201 

GACCCATGTTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGG 1260 
Qy 1261 

AAAGGGGGAGACAT GT CAAAGAC CAT GT ATGT GGACTACAAAACAGAGGAT GGT TCT GCC 1320 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1261 

AAAGGGGGAGACAT GT CAAAGAC CAT GT ATGT GGACTACAAAACAGAGGAT GGTTCT GCC 1320 
Qy 1321 



AAT GCAGGGGCT GACT AT GAGTT CACAGAGGGC AC GGT GGTT CT GAAGC C AGGAGAGACC 1380 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1321 

AAT GCAGGGGCT GACT AT GAGTTCACAGAGGGCACGGT GGTT CT GAAGCC AGGAGAGAC C 1380 
Qy 1381 

C AGAAGGAGTT CT CCGT GGGC ATAATT GAT GAC GACAT TTT T GAGGAGGAT GAACACTT C 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1381 

C AGAAGGAGTTCT C C GT GGGCAT AATT GAT GACGACATTTTTGAGGAGGAT GAACACTT C 1440 
Qy 1441 

TTTGTAAGGTT GAGCAATGT CCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 1500 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1441 

TTTGTAAGGTT GAGCAATGT CCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCTCCA 1500 
Qy 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 1501 

GCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACA 1560 
Qy 1561 

GT T AC CAT CTT GGAT GAT GACC AT GCAGGCAT CTT C ACT TTTGAAT GTGAT ACT ATT CAT 1620 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 1561 

GTT AC CAT CTT GGAT GAT GAC CAT GCAGGCAT CTT CACT TT T GAAT GT GAT ACT ATT CAT 1620 
Qy 1621 

GTCAGTGAGAGTATTGGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGT 1680 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1621 

GT CAGT GAGAGT AT T GGT GT TAT GGAGGT CAAGGTT CT GC GGACAT CAGGT GCCCGGGGT 1680 
Qy 1681 

ACAGT C AT CGT C CC CTTT AGGACAGT AGAAGGGACAGC CAAGGGT GGCGGT GAGGACTTT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 

ACAGT CAT CGT CC CCT TT AGGACAGT AGAAGGGACAGC CAAGGGT GGC GGT GAGGACTT T 1740 

Qy 1741 GAAG AC AC AT AT G GG GAGT T G GAAT T C AAGAAT GAT GAAAC 

T GT GAAAAC C AT AAG 1796 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1741 

GAAGACAC ATAT GGGGAGT T GGAAT T CAAGAAT GAT GAAACTGT AT GT GACAGACAGGAA 1800 

Qy 1797 GGTTAAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGA 1837 

III II I I I I I I I I I I I I I I I I I I 

Db 1801 GCT GAC TAT GGAAGAAGAGGAGGC CAAGAGGAT AGCAGAGA 1841 



RESULT 12 
US-10-281-866-1 

; Sequence 1, Application US/10281866 
; Publication No. US20030091570A1 
; GENERAL INFORMATION: 

; APPLICANT: Silos-Santiago, Inmaculada 

; TITLE OF INVENTION: Methods and compositions for the 

; TITLE OF INVENTION: treatment and diagnosis of pain disorders using 
46556 

; FILE REFERENCE: MPI01-272P1RM 

; CURRENT APPLICATION NUMBER: US/10/281, 866 

; CURRENT FILING DATE: 2002-10-28 

; PRIOR APPLICATION NUMBER: 60/335,078 

; PRIOR FILING DATE: 2001-10-31 

; NUMBER OF SEQ ID NOS : 3 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 1 

; LENGTH: 4282 

TYPE: DNA 
; ORGANISM: Homo sapien 
US-10-281-866-1 

Query Match 46.2%; Score 1277; DB 14; Length 4282; 

Best Local Similarity 69.1%; Pred. No. 0; 

Matches 1843; Conservative 0; Mismatches 775; Indels 51; 
Gaps 5; 

Qy 130 

AACAAT GAGT C CT GTT C AGGGT CAT C GGACT GCAAGGAGGGT GT CAT CCTGCCAATCT GG 189 

III I I III I II I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 208 

AGCACAGGGGGCTGCCAGGGGTCCTACCGCTGCCAGCCGGGGGTGCTGCTGCCCGTGTGG 267 
Qy 190 

TACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTATTTTGTGGCC 249 

I I I I I I I I I I I I I I I I I I I I I II I II I I I I I 

I II I I I I I I 

Db 268 

GAGCCCGACGACCCGTCGCTGGGTGACAAGGCGGCACGGGCAGTGGTGTACTTTGTGGCC 327 
Qy 250 

CTGATATACATGTTCCTTGGGGTGTCCATCATTGCTGACCGCTTCATGGCATCTATTGAA 309 

II I I I I I I I I I II II II I I I I I I I I I II I I I II I I I I I I I I I II 

I I 

Db 328 

AT GGT CT ACAT GTT TCT GGGAGT GT C CAT CAT CGCC GAC C GTTTCAT GGCGGCCATC GAG 387 
Qy 310 

GT CAT C AC CT CT CAAGAGAGGGAGGT GACAATTAAGAAAC CCAAT GGAGAAACCAGC ACA 369 

I I I I I I I I I I I II I I I I I I I I II I I I II I I I I I I I I I I I I I I 

Db 388 

GT CAT C AC GT CAAAAGAGAAGGAGAT CAC CAT CACCAAGGCCAACGGT GAGACC AGC GT G 447 
Qy 370 

ACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCC 429 



Db 448 

GGCACCGTTCGCATCTGGAATGAGACGGTGTCCAACCTCACGCTCATGGCCCTGGGCTCC 507 



Qy 430 

TCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGT 489 

i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II 

I I I 

Db 508 

TCCGCACCTGAGATCCTGCTGTCAGTCATCGAAGTCTGCGGCCACAACTTCCAGGCGGGT 567 
Qy 490 

GAT CT GGGACCT T CT ACCATT GT AGGGAGT GC AGCCTT CAACAT GT T CAT CAT CAT TGGC 549 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 568 

GAGCTGGGCCCAGGCACCATCGTGGGCAGCGCTGCCTTCAACATGTTTGTGGTCATCGCC 627 
Qy 550 

AT CT GTGTCTACGTGATCCCAGACGGAGAGACTCGCAAGAT CAAGCATCTACGAGTCTTC 609 
I II I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I II 

I I I I I I I I 

Db 628 

GTGTGCATCTACGTCATCCCAGCCGGCGAGAGCCGCAAGATCAAGCACCTGAGAGTCTTC 687 
Qy 610 

TTCATCACCGCTGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTC 669 

II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

I I 

Db 688 

TTTGTCACTGCCTCTTGGAGCATCTTCGCCTATGTCTGGCTTTATCTCATCCTTGCTGTT 747 
Qy 670 

TTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTG 729 

II I I I I I I I I I I I I I I I I I I I I I I I I I II II II II I I I I I I I II 

I I I 

Db 748 

TTTTCCCCCGGTGTGGTCCAGGTGTGGGAGGCGCTGCTGACCCTGGTCTTCTTCCCGGTG 807 
Qy 730 

T GT GTC CTT CT GGCCT GGGT GGCAGATAAAC GACT GCT CTT CT ACAAAT AC AT GCACAAA 789 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 808 

TGCGTGGTATTCGCCTGGATGGCCGACAAGCGGCTGCTCTTCTACAAGTACGTGTACAAG 867 
Qy 790 

AAGTACCGCACAGACAAACACCGAGGAATTATCATAGAGACAGAGGGTGACCACCCTAAG 849 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

I I I 

Db 868 

CGCTACCGCACCGACCCACGCAGCGGCATCATCATAGGCGCCGAGGGCGACCCCCCGAAG 927 
Qy 850 

GGCATTGAGATGGAT GGGAAAATGAT GAATT CCCATTTT CT AGATGGGAA 899 

I I II I II I I I I II I I I I III MINI 

Db 928 

AGCATCGAGCTGGACGGCACGTTCGTGGGCGCCGAGGCCCCAGGTGAGCTGGGCGGCCTG 987 
Qy 900 — 

C CT GGT GC CC CT GGAAGGGAAGGAAGT GGAT GAGT CC C GCAGAGAGATGAT C CGGATT 957 

I I I I III I I I I II I I II I I I II I III I MM 

1 1 1 



Db 988 

GGCCCGGGCCCCGCCGAGGCGCGCGAGCTGGACGCCAGCCGCCGCGAGGTCATCCAGATC 1047 
Qy 958 

CT CAAGGAT CTGAAGCAAAAACAC CCAGAGAAGGACTT AGAT CAGCT GGT GGAGAT GGCC 1017 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I I 

Db 1048 

CT CAAGGACCTCAAGCAGAAGCAC C C GGACAAGGATCT GGAGCAGCT GGT GGGC AT CGCC 1107 
Qy 1018 

AATT ACT AT GCT CTTT C CCACCAACAGAAGAGCCGCGC CTT CTAC C GT ATCCAAGCCACT 1077 

II I I I I I II II I I I I I I II I II I I I I I I II I I I I I I I I I I I I I I 

I I I I I 

Db 1108 

AACTACTACGCGCTGCTGCACCAGCAGAAGAGCCGCGCCTTCTACCGCATCCAGGCCACG 1167 
Qy 1078 

C GT AT GATGACT GGT GCAGGCAAT AT CCTGAAGAAAC AT GCAGCAGAACAAGCCAAGAAG 1137 

II I I I I I I I I I I I I I I I I II I I III II II II I 

I 

Db 1168 

CGGCTGATGACCGGCGCCGGGAACGTGCTGCGCAGACACGCGGCGGACGCCTCGCGCAGG 1227 
Qy 1138 

GCCTC C AGCAT GAGCGAGGT GCACAC C GAT GAGC CT GAGGACTT T ATTT CCAAGGT CTT C 1197 

III I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1228 GCGGC 

GCCGGCCGAGGGCGCGGGCGAGGACGAAGACGACGGCGCCAGCCGCATCTTC 1284 
Qy 1198 

TTT GACCCAT GTT CTT AC C AGTGC CT GGAGAACT GTGGGGCTGT ACT CCTGACAGT GGT G 1257 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

I 

Db 1285 

TTCGAGCCTAGCCTCTACCACTGCCTGGAGAACTGCGGCTCCGTGCTGCTGTCCGTCACG 1344 
Qy 1258 

AGGAAAGGGGGAGACAT GT CAAAGAC CAT GT AT GT GGACT ACAAAAC AGAGGAT GGTT CT 1317 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1345 

TGCCAGGGCGGCGAGGGCAACAGCACCTTCTACGTGGACTACCGCACTGAGGACGGCTCT 1404 
Qy 1318 

GC CAAT GCAGGGGCT GACT AT GAGTT C AC AGAGGGCAC GGT GGT T CT GAAGCCAGGAGAG 1377 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 1405 

GCCAAGGCGGGCTCCGACTACGAGTACAGCGAGGGCACGCTGGTGTTCAAACCAGGCGAG 1464 
Qy 1378 

ACCCAGAAGGAGTTCT CCGTGGGCATAATTGAT GACGACATTTTT GAGGAGGATGAACAC 1437 

II I I I I I I I I I I I I I I I I I II II I I I I I I I I II I I I I I I I I II 

I I 

Db 1465 

ACGCAGAAGGAGCT GC GC AT C GGC AT CAT C GACGACGAC AT CTT C GAGGAGGACGAGC AT 1524 



Qy 1438 



TTCTTTGTAAGGTTGAGCAATGTCCGCATAGAGGAGGAGCAGCCAGAGGAGGGGATGCCT 1497 

I I I I I I I I I I I II I I I I I I I I I I I I I I II I 

Db 1525 

TTCTTCGTGCGGCTGCTGAACCTGCGCGTGGGCGACGCGCAGGGCATGTTCGAGCCG 1581 



Qy 1498 

CCAGCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCC 1557 

III III II III III 

I I I I I 

Db 1582 

GACGGCGGCGGGCGGCCCAAGGGGCGGCTGGTGGCGCCGCTGCTGGCC 1629 



Qy 1558 

ACAGTT ACCAT CTT GGAT GAT GACCATGCAGGCAT CTT CACT TTT GAAT GTGAT ACT ATT 1617 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1630 

ACCGTC ACC ATC CT GGAC GACGAC CACGCAGGCAT CTT CT CCTT C C AGGAC CGC CT GCT G 1689 



Qy 1618 

CAT GT C AGT GAGAGT ATT GGT GTT AT GGAGGT CAAGGTT CT GCGGACAT CAGGT GC CCGG 1677 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

I I 

Db 1690 

CACGTGAGCGAGTGCATGGGCACCGTGGACGTGCGCGTCGTGCGCAGCTCGGGCGCGCGC 1749 



Qy 1678 

GGT AC AGT CATC GT C CC CTTT AGGAC AGT AGAAGGGACAGCCAAGGGT GGC GGT GAGGAC 1737 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

I I 

Db 1750 

GGCACCGTGCGCCTTCCCTACCGCACGGTGGACGGCACGGCGCGCGGCGGCGGCGTGCAC 1809 



Qy 1738 

TTT GAAGACAC AT AT GGGGAGTT GGAATT CAAGAAT GAT GAAACT GT GAAAAC C ATAAGG 1797 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

I 

Db 1810 

T AC GAGGAC GC GT GCGGAGAGCT GGAGT TT GGC GACGAC GAGAC CAT GAAAACT CTT CAG 1869 



Qy 1798 

GTTAAAAT AGT AGAT GAGGAGGAAT ACGAAAGGCAAGAGAATTT CTT CATT GCCCTTGGT 1857 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 

I I 

Db 1870 

GT GAAGATAGTT GAT GAC GAGGAAT ATGAGAAAAAGGAT AATT TCT T CATT GAGCT GGGC 1929 



Qy 1858 GAACCGAAATGGATGGAACGTGGAATATCAGATGTGACA 

GAC 1899 

III I I I I I I I I II I I I I II I I I I 

I I I 

Db 1930 

CAGCCCCAGTGGCTTAAGCGAGGGATTTCAGCTCTGCTACTCAATCAAGGGGATGGGGAC 1989 



Qy 1900 

AGGAAGCT GACT AT GGAAGAAGAGGAGGCCAAGAGGAT AGCAGAGAT GGGAAAGC CAGTA 1959 
I I I I I I I I II II II I II I I II I I I I I I I I I I I I I I I I I I I 

I I I II I I I 

Db 1990 

AGGAAGCT AACAGC CGAGGAGGAG GAGGCTC GGAGGAT AGCAGAGAT GGGCAAGCCAGTT 2049 



Qy 1960 

TT GGGT GAACACC CCAAACT AGAAGT CAT CATT GAAGAGT CCT AT GAGTTCAAGACT ACG 2019 

I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

I I I 

Db 2050 

CT T GGGGAGAACT GCC GGCT GGAGGT CAT CAT C GAGGAGTC AT AT GAT TTTAAGAACACG 2109 
Qy 2020 

GT GGACAAACT GAT CAAGAAGACAAAC CT GGCCT T GGTT GT GGGGACCCAT T CCT GGAGG 2079 

I I I I I I I II I I I I I I I I I II III I I I I I I I I I I I I I I II I I I I I I 

I I I I I I 

Db 2110 

GT GGATAAACT CAT CAAGAAAAC GAACT T GGCCTT GGTAATT GGGACC CAT T CAT GGAGG 2169 

Qy 2080 GAC CAGTT CAT GGAGGC C AT CAC CGT CAGTGCAGC 

AGGGGAT GAG GAT GAGGAT GAA 2136 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2170 

GAGCAGT T TT T AGAGGCAAT T ACGGT GAGC GCAGGGGAC GAGGAGGAGGAGGAGGACGGG 2229 
Qy 2137 

TCCGGGGAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTC 2196 

III I I I I II I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I II I II 

III 

Db 2230 

TCCCGGGAGGAGCGGCTGCCGTCGTGCTTTGACTACGTGATGCACTTCCTGACGGTGTTC 2289 
Qy 2197 

TGGAAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTC 2256 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2290 

TGGAAGGTGCTCTTCGCCTGTGTGCCCCCCACCGAGTACTGCCACGGCTGGGCCTGCTTT 2349 
Qy 2257 

GCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCAC 2316 

I I I I I I I I I I I I I I I I III I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2350 

GGTGTCTCCATCCTGGTCATCGGCCTGCTCACCGCCCTCATTGGGGACCTCGCCTCCCAC 2409 
Qy 2317 

T T C GGCT GCACCATT GGTCT CAAAGAT T C AGT CACAGCT GTT GTTTTC GTGGCAT TTGGC 2376 

I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I II I I I II I 

I I I 

Db 2410 

TTCGGCTGCACCGTTGGCCTCAAGGACTCTGTCAATGCTGTTGTCTTCGTTGCCCTGGGC 2469 
Qy 2377 

ACCT CT GT CC C AGATAC GTT TGC CAGCAAAGCT GCT GC C CT C C AGGAT GTAT AT GCAGAC 2436 



Db 2470 

ACCTCCATCCCTGACACGTTCGCCAGCAAGGTGGCGGCGCTGCAGGACCAGTGCGCCGAC 2529 
Qy 2437 

GCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTG 2496 

II I I I I I I I I I I I I I I I I III I I I I I I II II II I I I I I III I III 

I 



Db 2530 

GCGTCCATCGGCAACGTGACCGGCTCCAACGCGGTGAACGTGTTCCTTGGCCTGGGCGTC 2589 



Qy 2497 

GCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCC 2556 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II 

I 

Db 2590 

GCCTGGTCTGTGGCCGCCGTGTACTGGGCGGTGCAGGGCCGCCCCTTCGAGGTGCGCACT 2649 
Qy 2557 

GGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTG 2616 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I 

I I I I 

Db 2650 

GGCACGCTGGCCTTCTCCGTCACGCTCTTCACCGTCTTCGCCTTCGTGGGCATTGCCGTG 2709 
Qy 2617 

CTCTTGTACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAG 2676 



Db 2710 

CTGCTGTACCGGCGCCGGCCGCACATCGGCGGCGAGCTGGGCGGCCCGCGCGGACCCAAG 2769 
Qy 2677 

CTCGCCACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTA 2736 

I I I II I I I II I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2770 

CTCGCCACCACCGCGCTCTTCCTGGGCCTCTGGCTCCTGTACATCCTCTTCGCCAGCCTG 2829 

Qy 2737 GAGGCCTATTGCTACATCAAGGGGTTCTA 2765 

I I I I I I I I I I I I I I I I I I I I I I I 
Db 2830 GAGGCGTACTGCCACATCCGGGGCTTCTA 2858 



RESULT 13 
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; Publication No. US20030091570A1 
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LENGTH: 4282 

TYPE: DNA 
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Query Match 46.2%; Score 1277; DB 14; Length 4282; 

Best Local Similarity 69.1%; Pred. No. 0; 

Matches 1843; Conservative 0; Mismatches 775; Indels 51; 
Gaps 5; 

Qy 130 

AACAATGAGTCCTGTTCAGGGTCATCGGACTGCAAGGAGGGTGTCATCCTGCCAATCTGG 189 

III II III I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 208 

AGCACAGGGGGCTGCCAGGGGTCCTACCGCTGCCAGCCGGGGGTGCTGCTGCCCGTGTGG 2 67 
Qy 190 

TACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTATTTTGTGGCC 249 

I I I I I I I I I I I I I I I I I I II I II III I I I I I 

II I I I I I I I 

Db 268 

GAGCCCGACGACCCGTCGCTGGGTGACAAGGCGGCACGGGCAGTGGTGTACTTTGTGGCC 327 
Qy 250 

CT GAT AT ACAT GT T C CTT GGGGT GT C CAT CATT GCT GACCGCTTCAT GGCATCT ATT GAA 309 

II I I I II I I I I II II I I I I I I I I I II M I I I I I I I I I I II I I II 

I I 

Db 328 

ATGGTCTACATGTTTCTGGGAGTGTCCATCATCGCCGACCGTTTCATGGCGGCCATCGAG 387 
Qy 310 

GTCATCACCT CTCAAGAGAGGGAGGT GACAATTAAGAAACCCAATGGAGAAACCAGCACA 369 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 388 

GT CAT CACGT CAAAAGAGAAGGAGAT C AC CAT CACCAAGGCCAAC GGT GAGAC C AGCGT G 447 
Qy 370 

ACCACTATTCGGGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCC 429 

III MM II I I I I II M II M I I I II II I II II II I I I II I M I 

I I I 

Db 448 

GGCACCGTTCGCATCTGGAATGAGACGGTGTCCAACCTCACGCTCATGGCCCTGGGCTCC 507 
Qy 430 

T CT GCT CCT GAGATACT CCT CT CTT TAATTGAGGTGTGT GGT CAT GGGTT CATT GCT GGT 489 

II I I I I I I I M I I I I I I I I I I II I I II II I I I II II 

I I I 

Db 508 

T CCGCACCT GAGAT C CT GCT GT CAGT CAT CGAAGT CT GCGGCC ACAACT T CCAGGC GGGT 5 67 
Qy 490 

GAT CTGGGACCTT CT AC CATT GT AGGGAGTGC AGC CTTCAACAT GTT CAT CAT CATT GGC 549 

II I I II I I I II II I II I I I I II I I II II I I I I II I I I I I I I I 

I 

Db 568 

GAGCTGGGCCCAGGCACCATCGTGGGCAGCGCTGCCTTCAACATGTTTGTGGTCATCGCC 627 
Qy 550 

ATCT GT GTCTACGT GATCCCAGACGGAGAGACTCGCAAGATCAAGCATCTACGAGTCTTC 609 

I II I M M II II II I II Ml II II I I II I M II I II I I M 

I I I I II I I 

Db 628 

GT GT GCATCT AC GT CAT C CC AGCC GGCGAGAGC C GCAAGAT CAAGCAC CT GAGAGT CT T C 687 



Qy 610 

T T CATCAC CGCT GCTT GGAGT AT CT T T GC CT AC AT CT GGCT CT ATAT GATTCTGGCAGT C 669 

II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 688 

TTTGTCACTGCCTCTTGGAGCATCTTCGCCTATGTCTGGCTTTATCTCATCCTTGCTGTT 747 
Qy 670 

TTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTG 729 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

I I I 

Db 748 

TTTTCCCCCGGTGTGGTCCAGGTGTGGGAGGCGCTGCTGACCCTGGTCTTCTTCCCGGTG 807 
Qy 730 

T GT GT CCT T CTGGCCT GGGTGGCAGATAAAC GACT GCT CT T CT ACAAAT ACAT GCACAAA 789 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I II 

Db 808 

T GC GT GGT ATTC GCCT GGATGGCCGACAAGCGGCT GCT CT T CT ACAAGT ACGT GTACAAG 8 67 
Qy 790 

AAGT ACC GCACAGACAAAC AC CGAGGAAT TAT C AT AGAGACAGAGGGT GACCACCCTAAG 849 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

I I I 

Db 868 

C GCT AC C GCAC CGAC C CACGCAGC GGC AT CAT C AT AGGCGC C GAGGGCGACCCCCC GAAG 927 
Qy 850 

GGCAT T GAGAT GGAT GGGAAAAT GAT GAATT C CCATTT T CT AGAT GGGAA 899 

I I I I I I I I I I I I I I I II III I I I I I I 

Db 928 

AGCATCGAGCTGGACGGCACGTTCGTGGGCGCCGAGGCCCCAGGTGAGCTGGGCGGCCTG 987 
Qy 900 — 

CCTGGTGCCCCTGGAAGGGAAGGAAGTGGATGAGTCCCGCAGAGAGATGATCCGGATT 957 

MM III I I I I II I I II I I II I I I I I I M II 

I I I 

Db 988 

GGCCCGGGCCCCGCCGAGGCGCGCGAGCTGGACGCCAGCCGCCGCGAGGTCATCCAGATC 1047 
Qy 958 

CT CAAGGAT CTGAAGCAAAAAC AC C CAGAGAAGGACTT AGATCAGCTGGT GGAGAT GGC C 1017 

II II I I I I I I I I I I I I I I II I I II II I II I I I II I I II I I I I M 

I I I 

Db 1048 

CT CAAGGACCTCAAGCAGAAGCACCCGGACAAGGATCT GGAGCAGCTGGTGGGCATCGCC 1107 
Qy 1018 

AAT T ACT AT GCT CTT T CC C ACCAACAGAAGAGC CGCGCCTT CT ACCGTAT CCAAGC C ACT 1077 

II I I I I I I I II Mill II I I I II I II I I II II II II M I II II I 

II I I I 

Db 1108 

AACT ACT AC GCGCTGCT GC AC CAGCAGAAGAGC C GCGC CTT CT ACC GCAT CCAGGCCACG 1167 
Qy 1078 

C GT AT GAT GACT GGT GCAGGCAAT AT CCT GAAGAAAC AT GCAGCAGAACAAGC CAAGAAG 1137 

II II I II I I I I M II I I MM I I I I II II II I I 



Db 1168 

CGGCTGATGACCGGCGCCGGGAACGTGCTGCGCAGACACGCGGCGGACGCCTCGCGCAGG 1227 



Qy 1138 

GC CT C CAGC AT GAGCGAGGT GCACACCGAT GAGC CT GAGGACTTT ATTT CCAAGGTCT TC 1197 
III I I I I I I I I I I I I I I I I I 

I I II I 

Db 1228 GCGGC 

GCCGGCCGAGGGCGCGGGCGAGGACGAAGACGACGGCGCCAGCCGCATCTTC 1284 
Qy 1198 

T T T GAC C CAT GT T CTT AC C AGT G C CT GGAGAACT GT GGGGCT GT ACT C CT GACAGT G GT G 1257 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1285 

TTCGAGCCTAGCCTCTACCACTGCCTGGAGAACTGCGGCTCCGTGCTGCTGTCCGTCACG 1344 
Qy 1258 

AGGAAAGGGGGAGACAT GT CAAAGAC CAT GT AT GT GGACT ACAAAACAGAGGAT GGTT CT 1317 



Db 1345 

TGCCAGGGCGGCGAGGGCAACAGCACCTTCTACGTGGACTACCGCACTGAGGACGGCTCT 1404 
Qy 1318 

GCCAAT GCAGGGGCTGACT ATGAGT T CACAGAGGGC AC GGT GGTT CTGAAGCCAGGAGAG 1377 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

I I I 

Db 1405 

GCCAAGGCGGGCTCCGACTACGAGTACAGCGAGGGCACGCTGGTGTTCAAACCAGGCGAG 1464 
Qy 1378 

AC C CAGAAGGAGTT CT C CGT GGGCATAAT TGAT GAC GAC AT T T T T GAGGAGGAT GAAC AC 1437 

II I I I I I I I I I I I I I I I I I II II I I I I I I I I II I I I I I I I I II 

I I 

Db 1465 

ACGCAGAAGGAGCT GCGCATCGGCAT CATC GAC GAC GACAT CTT CGAGGAGGACGAGCAT 1524 
Qy 1438 

TT CT TT GTAAGGTT GAGCAAT GT CC GCAT AGAGGAGGAGCAGCCAGAGGAGGGGATGCCT 1497 

I I I I I I I I I I I II I I I I I I I I I I I I I I II 

Db 1525 

TTCTTCGTGCGGCTGCTGAACCTGCGCGTGGGCGACGCGCAGGGCATGTTCGAGCCG 1581 

Qy 1498 

CCAGCAATATTCAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCC 1557 



Db 1582 

GACGGCGGCGGGCGGCCCAAGGGGCGGCTGGTGGCGCCGCTGCTGGCC 1629 
Qy 1558 

AC AGT T AC CAT CTT GGATGATGAC C AT GCAGGCAT CT T C ACTTTT GAAT GT GAT ACT ATT 1617 
II II I I I I I I I II I II I I I I I I I I I I I I I I I I I I II I 

I 

Db 1630 

AC CGT CACCAT CCT GGACGACGAC CACGCAGGCAT CTT CT CCTTC CAGGAC CGCCT GCT G 1689 
Qy 1618 

CAT GTCAGTGAGAGTATTGGTGTTAT GGAGGT CAAGGTTCTGCGGACATCAGGT GCCCGG 1677 



1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 1 

1 1 

Db 1690 

CACGTGAGCGAGTGCATGGGCACCGTGGACGTGCGCGTCGTGCGCAGCTCGGGCGCGCGC 1749 
Qy 1678 

GGT ACAGT CAT CGT C C CCTTT AGGACAGT AGAAGGGAC AGC CAAGGGT GGCGGT GAGGAC 1737 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1750 

GGCACCGTGCGCCTTCCCTACCGCACGGTGGACGGCACGGCGCGCGGCGGCGGCGTGCAC 1809 
Qy 1738 

TTT GAAGACACATAT GGGGAGTT GGAATT CAAGAAT GAT GAAACT GT GAAAACCATAAGG 1797 

I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I 

I 

Db 1810 

TAC GAGGACGC GT GC GGAGAGCT GGAGTTTGGCGAC GAC GAGAC CAT GAAAACTCTT CAG 1869 
Qy 1798 

GTTAAAAT AGT AGAT GAGGAGGAAT AC GAAAGGCAAGAGAAT TT CTT CAT TGC CCTT GGT 1857 

I I I I I I'l I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 

I I 

Db 1870 

GT GAAGAT AGTT GAT GACGAGGAAT AT GAGAAAAAGGATAAT TT CTT CAT TGAGCT GGGC 1929 



Qy 1858 GAACCGAAAT GGATGGAACGTGGAAT ATCAGATGT GACA 

GAC 1899 

III I I II I I I I I I I I I I I I I I I I 

I I I 

Db 1930 

CAGCCCCAGTGGCTTAAGCGAGGGATTTCAGCTCTGCTACTCAATCAAGGGGATGGGGAC 1989 
Qy 1900 

AGGAAGCT GACTAT GGAAGAAGAGGAGGC CAAGAGGAT AGCAGAGAT GGGAAAGC C AGTA 1959 
I I I I I I I I II II II I I I I I I I I II I I II I I I I I I I I I I I I 

I I I I I I I I 

Db 1990 

AGGAAGCTAACAGC C GAGGAGGAGGAGGCTCGGAGGAT AGCAGAGAT GGGCAAGC CAGTT 2049 
Qy 1960 

TT GGGT GAAC ACCCCAAACTAGAAGT CAT CATT GAAGAGT C CT AT GAGT T CAAGACT ACG 2019 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2050 

CTT GGGGAGAACT GC C GGCTGGAGGT CAT C AT CGAGGAGT CAT AT GATTT T AAGAAC ACG 2109 
Qy 2020 

GT GGACAAACT GAT CAAGAAGACAAACCT GGC CTT GGTT GT GGGGAC CCATT CCT GGAGG 2079 

I I I I I I I I I I I I I I II I I II I I I I I I I I II I I I I I I I I I I I I I I I 

MINI 

Db 2110 

GT GGATAAACT CAT CAAGAAAAC GAACTT GGCCT T GGTAATT GGGAC CCATT CAT GGAGG 2169 

Qy 2080 GACC AGTT CATGGAGGCC AT C AC CGT CAGT GCAGC 

AGGGGAT GAGGAT GAGGAT GAA 2136 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 2170 

GAGCAGT T T TT AGAGGCAATT ACGGT GAGCGCAGGG GAC GAGGAGGAGGAGGAGGAC GGG 2229 



Qy 2137 

TCCGGGGAGGAGAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTC 2196 

III I II I I I I I I I I I I I I II I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I II 

I I I 

Db 2230 

TCCCGGGAGGAGCGGCTGCCGTCGTGCTTTGACTACGTGATGCACTTCCTGACGGTGTTC 2289 



Qy 2197 

TGGAAGGTGCTGTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTC 2256 

I I I I I I I I I I I II II I I I I I I I I I II I I I I 
I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2290 

TGGAAGGTGCTCTTCGCCTGTGTGCCCCCCACCGAGTACTGCCACGGCTGGGCCTGCTTT 2349 



Qy 2257 

GCCGTCTCCATCCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCAC 2316 

I I I I I I I I I I I I I I I I III I I I I I I I I I II I I I I I I II I I I I I I I I I I 

I I I 

Db 2350 

GGTGTCTCCATCCTGGTCATCGGCCTGCTCACCGCCCTCATTGGGGACCTCGCCTCCCAC 2409 



Qy 2317 

TTCGGCTGCACCATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGC 2376 

I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I II I II I 

I I I 

Db 2410 

TTCGGCTGCACCGTTGGCCTCAAGGACTCTGTCAATGCTGTTGTCTTCGTTGCCCTGGGC 24 69 
Qy 2377 

ACCT CT GT C C CAGAT AC GTTT GCCAGCAAAGCT GCTGC C CT CCAGGATGT ATAT GCAGAC 2436 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II 

I I I 

Db 2470 

AC CT C CAT CCCT GACAC GT TC GC CAGCAAGGTGGCGGC GCT GCAGGACCAGT GCGC C GAC 2529 



Qy 2437 

GCCTCCATTGGCAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTG 2496 

II I I I I I I I I I I I I I I I I III I I I I I I II II II I I I I I III I III 

I 

Db 2530 

GCGTCCATCGGCAACGTGACCGGCTCCAACGCGGTGAACGTGTTCCTTGGCCTGGGCGTC 2589 



Qy 2497 

GCCTGGTCCGTGGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCC 2556 
I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 2590 

GCCTGGTCTGTGGCCGCCGTGTACTGGGCGGTGCAGGGCCGCCCCTTCGAGGTGCGCACT 2649 



Qy 2557 

GGCACACTGGCCTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTG 2616 

I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I II II II I I I I 

I I I I 

Db 2650 

GGCACGCTGGCCTTCTCCGTCACGCTCTTCACCGTCTTCGCCTTCGTGGGCATTGCCGTG 2709 



Qy 2617 

CTCTTGTACCGAAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAG 2676 

II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

I I I I 



Db 2710 

CTGCTGTACCGGCGCCGGCCGCACATCGGCGGCGAGCTGGGCGGCCCGCGCGGACCCAAG 2769 
Qy 2677 

CTCGCCACAACATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTA 2736 

I I I I I I I I II I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I 

I I 

Db 2770 

CTCGCCACCACCGCGCTCTTCCTGGGCCTCTGGCTCCTGTACATCCTCTTCGCCAGCCTG 2829 

Qy 2737 GAGGCCTATT GCTACAT CAAGGGGTTCTA 2765 

I I I I I I I I I I I I I I I I I I I I I I I 
Db 2830 GAGGCGTACTGCCACATCCGGGGCTTCTA 2858 



RESULT 14 
US-10-388-934-506 

; Sequence 506, Application US/10388934 

; Publication No. US20040005547A1 

; GENERAL INFORMATION: 

; APPLICANT: Boess, Franziska 

; APPLICANT: Suter-Dick, Laura 

; APPLICANT: Wolf, Detlef 

; TITLE OF INVENTION: BIOMARKERS AND EXPRESSION PROFILES FOR TOXICOLOGY 
; FILE REFERENCE: 21199 

; CURRENT APPLICATION NUMBER: US/ 10/38 8 , 934 

; CURRENT FILING DATE: 2003-03-14 

; PRIOR APPLICATION NUMBER: 02005336.9 

; PRIOR FILING DATE: 2002-03-14 

; PRIOR APPLICATION NUMBER: 02015657.6 

; PRIOR FILING DATE: 2002-07-17 

; NUMBER OF SEQ ID NOS : 862 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 506 

LENGTH: 3004 

TYPE: DNA 

ORGANISM: Rattus norvegicus (No. US20040005547Alway rat) 
US-10-388-934-506 

Query Match 45.9%; Score 1270.8; DB 16; Length 3004; 

Best Local Similarity 69.0%; Pred. No. 0; 

Matches 1835; Conservative 0; Mismatches 772; Indels 51; 
Gaps 5; 

Qy 141 

CT GTTCAGGGT CAT CGGACT GCAAGGAGGGT GT CAT C CT GCCAAT CT GGT ACC C GGAGAA 200 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 126 

CTGCCAAGGTTCCTACCGCTGCCAACCAGGGGTGCTGCTGCCTGTGTGGGAACCCGACGA 185 
Qy 201 

CCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGTCTATTTTGTGGCCCTGATATACAT 260 

I I I I I I I I I I II I I I I Ml I I I I I I I I I I I I I I II I I I I I 

I I I I I 

Db 186 

CCCATCACTGGGGGACAAGGCTGCACGGGCCGTGGTGTACTTTGTGGCCATGGTCTACAT 245 



Qy 261 

GT T C CT T GGGGT GT C CAT CATT GCT GAC C GCTT CAT GGCATCT AT T GAAGT CAT CAC CT C 320 



1 1 1 1 I II 1 1 1 1 I II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 

Db 246 

GT T CT T GGGT CT GT CT ATCATT GCT GAT CGTT T T AT GGCAT CCATT GAGGT CAT CACATC 305 

Qy 321 

T CAAGAGAGGGAGGTGACAATTAAGAAAC CCAAT GGAGAAACC AGCACAACCACT ATT C G 380 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

I I 

Db 306 

CAAGGAGAAAGAGATCACCATCACCAAGGCAAATGGGGAGACCAGCGTGGGCACTGTACG 365 
Qy 381 

GGTCTGGAATGAAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCCTCTGCTCCTGA 440 

I I I I I I I II I I I I II I I I I I I I I II II I I I I I I I I I I I I I I I I II 

I I I I I 

Db 366 

CATCTGGAATGAAACGGTGTCCAACCTTACACTCATGGCCCTGGGCTCCTCAGCACCTGA 425 
Qy 441 

GATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGATCTGGGACC 500 

III II II III I I I I I I I I I I I I I II II I II I I II I II II 

I I 

Db 426 

GATT CT GCT GT CT GTCAT C GAGGT CT GT GGC CACAACTT CCAGGC GGGT GAGCTAGGCCC 485 
Qy 501 

TTCTACCATT GTAGGGAGT GCAGCCTTCAACATGTT CAT CATCATTGGCATCTGTGTCTA 560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

I I 

Db 486 

AGGCACCATCGTGGGCAGTGCCGCCTTCAACATGTTTGTGGTCATTGCTGTGTGTGTGTA 545 
Qy 561 

CGT GAT CC CAGAC GGAGAGACT C GCAAGAT CAAGC AT CT AC GAGTCTT CTT CAT C AC C GC 62 0 

I I I II I I I I I I II I I I I I I I I I I I I II I I I II I I I I I I I I I I I 

I I 

Db 546 

T GT CAT C CCGGCT GGTGAGAGCC GTAAGAT CAAGCACTTAAGGGT CTT CTT C GT CACAGC 605 
Qy 621 

TGCTTGGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGG 680 

II I I II I I I I I I I I I I I I I I I I I I I III I I I I I I Mill I I I I I II 

I I 

Db 606 

CTCTTGGAGCATCTTTGCCTATGTCTGGCTTTATCTCATTCTAGCAGTTTTCTCTCCAGG 665 
Qy 681 

TGTGGTCCAGGTTTGGGAAGGCCTCCTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCT 740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 666 

CGTGGTCCAGGTGTGGGAGGCACTGCTCACGCTGGTCTTCTTCCCGGTGTGTGTGGTGTT 725 
Qy 741 

GGCCT GGGT GGCAGATAAAC GACT GCT CT T CT ACAAAT ACAT GCACAAAAAGTACC GCAC 800 

II I II I I I I I II II I I I I I I I I I I I I I I I I I III II I I II II 

I I I I I 

Db 726 

CGCCTGGATGGCGGACAAGCGACTGCTCTTCTACAAGTACGTGTACAAGCGCTATCGCAC 785 



Qy 801 

AGACAAAC AC CGAGGAATT AT C AT AGAGACAGAGGGT GACCAC CCTAAGGGCATT GAGAT 860 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 786 

C GACCCT C GCAGCGGAAT CAT CATC GGGGCAGAGGGAGACCC GCCCAAGAGCAT CGAGCT 845 

Qy 861 GGATGGGAAAAT GAT GAAT T C CCAT TT T CT AGAT GGGAA 

CCTGGTGCC 908 

I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 846 

GGAT GGCACATT C GTGGGCACT GAGGT C CCAGGCGAGCT GGGT GC ATT GGGCACAGGT CC 905 



Qy 909 

C CT GGAAGGGAAGGAAGTGGAT GAGT C C CGCAGAGAGAT GAT C CGGATT CT CAAGGAT CT 968 

I I I I I II I I I I I III I I I I I I I I I II I I I I I I I I I 

I 

Db 906 

CGCTGAGGCGCGTGAGCTGGACGCCAGCCGGCGCGAGGTCATCCAGATTCTTAAGGACTT 965 



Qy 969 

GAAGCAAAAAC AC C C AGAGAAGGACT TAGAT CAGCT GGT GGAGAT GGC CAAT T ACTAT GC 1028 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I 

Db 966 

AAAGCAGAAGCAC C CGGATAAGGACCTGGAGCAGCT GGT GGGCAT CGC CAAGTACT AT GC 1025 



Qy 1029 

T CTT T C C CACCAACAGAAGAGC C GCGCCTT CT ACCGT AT CCAAGCCACT CGT AT GAT GAC 1088 
II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

I I II I I I 

Db 1026 

ACT GCT GC ACC AGCAGAAGAGCC GT GC CT TCT AC CGCAT C CAGGC C AC GCGGCT GAT GAC 1085 
Qy 1089 

T GGT GCAGGCAAT ATC CTGAAGAAACAT GCAGCAGAACAAGC CAAGAAGGC CTC CAGC AT 1148 
I I I II I I I I I I I I I I I I I I I I I I III II 

Db 1086 AGGTGCGGGCAACGTGCTGCGCCGACACGCTGCGGATGCTGCCCGCAGG 

CCTGGGGC 1142 



Qy 1149 

GAGCGAGGTGCACACCGAT GAGCCT GAGGACTTTATTTCCAAGGTCTT CTTT GAC CCAT G 1208 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1143 

CAACGAT GGT GCCCCC GAT GAT GAGGACGAT GGT GCCAGCCGCAT CT T CTT T GAGCC CAG 1202 
Qy 1209 

TTCTTACCAGTGCCTGGAGAACTGTGGGGCTGTACTCCTGACAGTGGTGAGGAAAGGGGG 1268 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

I I 

Db 1203 

CCTCTACCACTGCCTGGAGAACTGC GGGT CAGT GCT GCT GTCGGTGGCTTGCCAGGGTGG 1262 



Qy 1269 

AGACAT GT CAAAGACCATGT AT GT GGACT ACAAAACAGAGGAT GGTTCT GCCAAT GCAGG 1328 

II I I I I I I I I II I I I I I I I II I I II I I I I I I I I I 

I I I I I 



Db 1263 

T GAGGGCAACAGC AC CTT CTAC GT GGATT ACCGC AC GGAGGATGGCTCT GCAAAGGCAGG 1322 



Qy 1329 

GGCTGACT AT GAGTT CACAGAGGGCACGGTGGTT CT GAAGCCAGGAGAGACCCAGAAGGA 1388 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I 

Db 1323 

CT C CGATT AT GAGT ACAGC GAGGGC AC GCT GGT GTT CAAGC CCGGGGAGACGC AGAAGGA 1382 
Qy 1389 

GTT CT CCGT GGGCATAATT GAT GACGACATTTTT GAGGAGGAT GAACACTT CTTT GTAAG 1448 

I I I I I I I I I II II I I I I I I I I II I II I II I I II I I I I I I I I II 

I I 

Db 1383 

GCT GC GCAT CGGCAT CAT C GAC GACGAC ATCT T C GAGGAGGACGAGC ACTT CT T CGT GAG 1442 
Qy 1449 

GTT GAGCAAT GT C C GC AT AGAGGAGGAGCAGCC AGAGGAGGGGAT GC CT CCAGCAAT ATT 1508 
III I I I I I I I I I I I I I I II 

Db 1443 GCTGCTGAACCTGCGTGTGGGCGATGCTCAGGGCATGTTCGAG 

CC 1487 

Qy 1509 

CAACAGTCTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCAT 1568 

I I I I I II I II III III I I I I I I I I I 

II I I I 

Db 1488 

CGACGGCGGTGGGCGGCCCAAGGGGCGGCTGGTGGCGCCACTGCTGGCCACCGTCACCAT 1547 
Qy 1569 

CTT GGAT GAT GAC CAT GCAGGCAT CTT CACT TTT GAAT GT GAT ACTATT C AT GT C AGT GA 1628 

I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I 

I I 

Db 1548 

TCTGGACGACGACCACGCGGGCATCTTCTCCTTCCAGGACCGCCTGCTGCATGTGAGCGA 1607 
Qy 1629 

GAGT ATT GGT GTT AT GGAGGT CAAGGTT CT GCGGAC AT CAGGT GCCC GGGGT ACAGT CAT 1688 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 1608 

GTGCATGGGCACCGTGGATGTGCGTGTGGTGCGCAGCTCTGGCGCACGTGGCACCGTACG 1667 
Qy 1689 

CGT CC C CTTT AGGACAGTAGAAGGGACAGCCAAGGGT GGC GGT GAGGACTTT GAAGACAC 1748 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1668 

CCTTCCCTACCGCACAGTGGACGGCACGGCTCGTGGTGGTGGTGTGCACTACGAGGATGC 1727 
Qy 1749 

AT AT GGGGAGT T GGAATT CAAGAATGAT GAAACT GT GAAAAC C ATAAGGGTTAAAAT AGT 1808 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 1728 

TT GTGGAGAGCT GGAGTT C GGC GAT GAC GAGACCAT GAAGACT CTTCAGGTCAAGAT AGT 1787 
Qy 1809 

AGATGAGGAGGAAT AC GAAAGGCAAGAGAAT TT CTT C ATT GC CCTT GGT GAACC GAAATG 1868 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I 



Db 1788 

GGACGACGAAGAGTATGAGAAGAAGGACAACTT CTTCATCGAGCT GGGCCAGCCCCAGTG 1847 

Qy 1869 GATGGAACGTGGAATAT CAGATGTGAC 

AGACAGGAAGCTGAC 1910 

II I I I I I I I I I I I I I I 
I I I I I I I I I I I I I I I 
Db 1848 

GCT TAAGC GGGGCATCT CAGCT CT GCTACT CAAC CAAGGGGAT GGAGACAGGAAGCT GAC 1907 
Qy 1911 

TAT GGAAGAAGAGGAGGCCAAGAGGATAGCAGAGAT GGGAAAGCCAGT ATT GGGT GAACA 1970 

I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 

I 

Db 1908 

TGCAGAGGAGGAGGAGGCC C AGAGGATAGCAGAGAT GGGCAAGC CAGTT CTT GGGGAGAA 1967 
Qy 1971 

CC C CAAACT AGAAGTC AT C ATT GAAGAGT CCT AT GAGTT CAAGACT ACGGT GGACAAACT 2030 

I II II I I I I I I I I II II I I I I I I I I II MM II I I I I I II 

I I I I I 

Db 1968 

CT GT C GC CT C GAGGT CAT CATCGAGGAGT CTT AT GACTTTAAGAAT ACGGT GGATAAACT 2027 
Qy 2031 

GATCAAGAAGACAAACCTGGCCTTGGTTGTGGGGACCCATTCCTGGAGGGACCAGTTCAT 2090 

II I II II II I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I 

I 

Db 2028 

CAT AAAGAAAACAAAC CTGGCCTTGGT GATT GGGACC CACT CAT GGAGGGAGCAGTT T TT 2087 

Qy 2091 GGAGGCCATCACCGT CAGTGCAGCA 

GGGGAT GAGGAT GAGGAT GAAT C C GGGGAGGA 2147 

I II I I I I I I I I I I I I I I I II I I I I I I II I I I II II I 

I II I I 

Db 2088 

AGAGGCAGT T ACAGTGAGCGCAGGGGAC GAGGAGGAGGAT GAGGAT GGGTCT CGT GAGGA 2147 
Qy 2148 

GAGGCTGCCCTCCTGCTTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCT 2207 



Db 2148 

GCGGCTGCCATCCTGCTTTGACTACGTGATGCACTTCCTGACGGTGTTCTGGAAAGTTCT 2207 
Qy 2208 

GTTTGCCTGTGTGCCCCCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCAT 2267 

III I II II I II I I I I I I I II I I I I I I I II II I I I I II I II I I I I I I 

I I I 

Db 2208 

GTTCGCCTGCCTTCCACCCACGGAGTACTGCCATGGCTGGGCCTGCTTTGGTGTCTGCAT 2267 
Qy 2268 

CCTCATCATTGGCATGCTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCAC 2327 

III I I I II I I II I I I II III I I I I II I I II I I I I II I I I I I I II 

Mill 

Db 2268 

CCTGGTCATTGGTCTGCTCACTGCCCTCATCGGAGACCTGGCCTCACACTTTGGGTGCAC 2327 



Qy 2328 

CATTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCC 2387 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I 

Db 2328 

CGTGGGCCTCAAGGACTCAGTCAACGCCGTGGTCTTCGTGGCTCTGGGCACCTCCATCCC 2387 
Qy 2388 

AGAT AC GTTT GC CAGCAAAGCT GCT GCC CTCCAGGAT GT AT AT GCAGACGCCT CCATT GG 2447 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 2388 

TGACACGTTTGCCAGCAAGGTGGCCGCGCTGCAGGACCAGTGCGCCGACGCGTCCATCGG 2447 
Qy 2448 

CAACGTGACGGGCAGCAACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGT 2507 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II 

I I 

Db 2448 

TAACGTGACCGGCTCCAATGCGGTGAACGTGTTCCTGGGCCTGGGTGTGGCCTGGTCGGT 2507 
Qy 2508 

GGCCGCCATCTACTGGGCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGC 2567 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

I I I II 

Db 2508 

GGCCGCAGTGTACTGGGCGGTGCAGGGTCGCCCCTTCGAGGTGCGTACAGGCACGCTGGC 2567 
Qy 2568 

CTTCTCCGTCACCCTCTTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCG 2 62 7 

II I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I 

I I I I I I I I I I I I I I 

Db 2568 

CTTCTCGGTCACACTGTTCACCGTCTTCGCCTTCGTGGGCATCGCAGTGCTCTTGTACCG 2 627 
Qy 2628 

AAGGCGGCCGCACCTGGGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAAC 2687 

I I I I I I III I II II I II I I II I I I I I II II I I I I I I I I I I I 

I I 

Db 2628 

GCGCCGGCCACACATCGGCGGCGAGCTGGGCGGCCCGCGGGGACCCAAGCTAGCCACCAC 2687 
Qy 2688 

ATGGCTCTTTGTGAGCCTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTG 2747 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 2688 

CGCTCTCTTCCTGGGCCTCTGGTTCCTCTACATTCTCTTCGCCAGCCTGGAGGCTTATTG 2747 

Qy 2748 CTACAT CAAGGGGTT CT A 2765 

I I I I I III I I I I I 
Db 2748 CCACATTCGGGGCTTCTA 2765 



RESULT 15 
US-09-901-419-1 

; Sequence 1, Application US/09901419 
; Patent No. US20020069421A1 
; GENERAL INFORMATION: 

; APPLICANT: The Curators of the University of Missouri 



; TITLE OF INVENTION: LARGE SCALE EXPRESSION AND PURIFICATION OF 
RECOMBINANT 

; TITLE OF INVENTION: PROTEINS 
; FILE REFERENCE: UM01531.1 

; CURRENT APPLICATION NUMBER: US/09/901, 419 

; CURRENT FILING DATE: 2001-07-09 

; PRIOR APPLICATION NUMBER: 60/218,125 

; PRIOR FILING DATE: 2000-01-13 

; NUMBER OF SEQ ID NOS: 2 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 1 

LENGTH: 4087 

TYPE: DNA 
; ORGANISM: Bos taurus 

FEATURE : 

NAME/ KEY: CDS 

LOCATION: (268) (3180) 

NAME/KEY: sig_peptide 

LOCATION: (268) (363) 
; NAME/KEY: misc_f eature 

LOCATION: (3178) 

; OTHER INFORMATION: A Poly (H) affinity tag comprising 6 His residues 
; OTHER INFORMATION: have been inserted at the C-Terminus end of the 
; OTHER INFORMATION: coding region of the protein 
US-09-901-419-1 

Query Match 44.4%; Score 1227.8; DB 9; Length 4087; 

Best Local Similarity 67.1%; Pred. No. 0; 

Matches 1935; Conservative 0; Mismatches 777; Indels 171; 
Gaps 7; 

Qy 46 

TTTGGGCTGGTTACCTTTGTGCTCTTCCTGAATGGTCTTCGAGCAGAGGCTGGTGGCTCA 105 

III I I I I I I I I I I I I I I II I I III 

Db 307 

TTT C AC GT GAT AGC CATGGT GGCT CT CT T GTT TT CC C AT GT GGACCAT AT AAGT GCT GAG 366 
Qy 106 

GGGGAC GT GC CAAGCACAGGGCAGAACAATGAGT CCT GTT CAGGGT CAT C GGACT GCAAG 165 

II II II III I I I I I I I I I I I I I I I I I I 

I I I 

Db 367 

ACAGAAAT GGAAGGAGAAGGCAAC GAGACT GGC GAGT GT ACT GGCT CCT ATT ACT GTAAG 426 
Qy 166 

GAGGGTGTCATCCTGCCAATCTGGTACCCGGAGAACCCTTCCCTTGGGGACAAGATTGCC 225 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 427 

AAGGGGGTGATTTTACCCATTTGGGAGCCCCAGGACCCTTCCTTTGGAGACAAAATTGCT 486 
Qy 226 

AGGGTCATTGTCTATTTTGTGGCCCTGATATACATGTTCCTTGGGGTGTCCATCATTGCT 285 
III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I I 

Db 487 

AGAGCGACTGT GTATTTT GT GGCCAT GGT CTACAT GTTT CTT GGAGTCT CAATCATTGCT 546 



Qy 286 

GAC C GCTT CAT GGC AT CT ATT GAAGT CAT CAC CT CT CAAGAGAGGGAGGT GACAATT AAG 345 



1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 

1 1 1 

Db 547 

GACC GGTT C AT GT C CT CT AT AGAAGT CAT CACGT CT CAAGAGAAAGAAAT CAC CAT AAAG 606 
Qy 346 

AAACC CAAT GGAGAAACC AGCACAAC C ACT ATT CGGGT CT GGAAT GAAACT GT CT C CAAC 405 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 607 

AAACC CAAT GGAGAGACCAC CAAGACAACT GT GAGGAT CT GGAAT GAGACAGT GT C CAAC 666 



Qy 406 

CTGACCCTTATGGCCCTGGGTTCCTCTGCTCCTGAGATACTCCTCTCTTTAATTGAGGTG 465 

I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I II I I I I 

MINI 

Db 667 

CT GAC CT TGAT GGC CCT GGGGTCT T C AGCT CCAGAGATT CT C CTTT CAGTAAT C GAGGT G 726 



Qy 466 

TGT GGT CAT GGGTT CATT GCT GGT GAT CTGGGAC CTT CT ACCAT T GT AGGGAGTGCAGCC 525 

I I I I I III I I I I III II II II II III I I I I I II I I I I I I I I 

I I 

Db 727 

TGTGGCCATAACTTCACTGCAGGAGACCTTGGCCCTAGCACCATCGTGGGGAGTGCTGCA 786 



Qy 526 

T T CAAC AT GT T CAT CAT CATT GGC AT CT GT GT CT ACGT GAT C CC AGAC GGAGAGACTCGC 585 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I II II Mill 

I 

Db 787 

TTCAACATGTTCATCATCATTGCCCTTTGTGTGTATGTCGTCCCGGATGGGGAGACAAGG 846 



Qy 586 

AAGAT CAAGCAT CTACGAGTCTT CTT CATCACCGCTGCTTGGAGTATCTTT GCCTACATC 645 

I I II I I I II I I I I I II II I I I I I I II II II I I I II I I I I I I I I I I I 

I 

Db 847 

AAGAT CAAGCAT CT GC GT GT GTT CT TT GTGACAGCAGC AT GGAGCAT CTTT GCCT AT ACC 906 



Qy 646 

TGGCTCTATATGATTCTGGCAGTCTTCTCCCCTGGTGTGGTCCAGGTTTGGGAAGGCCTC 705 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

I 

Db 907 

TGGCTTTACATCATTTTGTCTGTCAGCTCCCCTGGGGTCGTGGAGGTCTGGGAAGGTTTG 966 



Qy 706 

CTCACTCTCTTCTTCTTTCCAGTGTGTGTCCTTCTGGCCTGGGTGGCAGATAAACGACTG 765 

I I I I I I I I I I I II I I I I Mill I I I I I II II I I I I I I I I 

I I 

Db 967 

CTTACTTTCTTCTTCTTCCCCATCTGCGTTGTGTTTGCTTGGGTGGCAGACAGGAGGCTT 1026 



Qy 766 

CT CTTCTACAAATACATGCACAAAAAGTACCGCACAGACAAACACCGAGGAATTAT CAT A 825 

I I I I I I I I I I I I I II I I I I I I I I I Ml II I I M I I I I 

I I 

Db 1027 

CT GT T TTACAAGT ATGT CT ACAAGAGGT AT CGGGCTGGCAAGCAGAGGGGAAT GATTATT 1086 



Qy 826 GAGACAGAGGGT GACCAC CC 

TAAGGGCATTGAGATGGATGGGAAAATGATG 876 

II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

I 

Db 1087 

GAAC AC GAAGGAGACAGGCCAT CTT CCAAGACAGAAATT GAAAT GGAT GGGAAAGT GGT C 1146 



Qy 877 AATTCCCATTTTC 

TAGATGGGAACCTGGTGCCCCTGGAAGGGAAG 921 

I I I I I I I I I I I I I I I I I I I I I I I I I II I II 

Db 1147 

AAT T CCCAT GTT GACAGTTT CTT AGATGGAGC C CT GGTT CTGGAGGT T GAT GAGAGGGAC 1206 



Qy 922 — 

GAAGTGGAT GAGT C CCGCAGAGAGAT GAT CCGGATT CT CAAGGAT CT GAAGCAAAAA 978 

III I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I 

Db 1207 

CAAGAT GAT GAAGAAGC C AGGCGAGAAAT GGCT AGGATT CTGAAGGAACT CAAGCAGAAG 1266 



Qy 979 

CAC C CAGAGAAGGACTT AGAT C AGCT GGT GGAGAT GGCCAATT ACT AT GCT CTTTC C CAC 1038 

I I I I I I I I I I I I I I I I I I I I I I I MINIMUM I 

I I 

Db 1267 

CAT C CAGAGAAGGAAAT AGAGCAATTAAT AGAATT AGCCAATT ACCAAGT CTTAAGT CAG 1326 



Qy 1039 

CAAC AGAAGAGC C GC GCCTT CT AC C GT AT CCAAGC C ACT CGT AT GAT GACT GGT GC AGGC 1098 

II II II II II II II I I I I I I I I I I I I I II II 1111111 II 

I II I I I 

Db 1327 

CAGCAAAAAAGT C GAGC GTTTT AC C GT AT T CAAGCT ACC C GCCTGAT GAC C GGAGCAGGC 1386 



Qy 1099 

AAT ATCCTGAAGAAACATGCAGCAGAACAAGCCAAGAAGGCCT CCAGCATGAGCGAGGT G 1158 
I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I 

I I I I I 

Db 1387 

AACATTTTAAAGAGGC AT GCAGCAGACCAAGC C AGGAAAGCT GT CAGCAT GCAT GAGGT C 1446 



Qy 1159 CACACC GAT GAGCCT G 

AGGACT TT AT TT C CAAGGT CT T CT T T GAC CCAT GTT CTT AC 1215 

I I I I I I I II I I I I I I II I I I I I I I I I I I I II M I 

I I 

Db 1447 

AACACGGAAGT GGCTGAAAATGACCCTGTCAGTAAGATCTTCTTT GAACAAGGGACATAT 1506 
Qy 1216 

CAGT GCCTGGAGAACTGTGGGGCTGTACTCCT GACAGT GGTGAGGAAAGGGGGAGACAT G 1275 

I I I I II I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I II 

I I 

Db 1507 

CAGT GT CT GGAGAACTGT GGCAC AGT AGCCCT GACCATT AT C CGCAGAGGT GGT GATT T G 1566 
Qy 1276 

TCAAAGACCAT GTAT GTGGACTACAAAACAGAGGAT GGTT CTGCCAAT GCAGGGGCT GAC 1335 

I I I I I I I I I I II I I II II I I I I I I I I I I I I I I I I I I I I I I 

MM 



Db 1567 

ACCAACACTGTGTTT GTTGACTTCAGAACAGAGGATGGCACAGCCAATGCTGGAT CTGAT 1626 



Qy 1336 

TATGAGTTCACAGAGGGCACGGTGGTT CTGAAGCCAGGAGAGACCCAGAAGGAGTTCT CC 1395 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II 

Db 1627 

TAC GAATT T AC C GAAGGAACT GT GGT CT TTAAGCCT GGT GAGACCCAGAAGGAAAT C AGA 1686 



Qy 1396 

GT GGGC ATAAT T GAT GAC GACATT TT T GAGGAGGAT GAACACTT CTTT GTAAGGTT GAGC 1455 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml I I I I 

I I I 

Db 1687 

GTT GGC AT CAT T GAT GAT GACAT CTTT GAGGAGGAT GAGAATTT C CTT GT GCAT CT CAGC 1746 



Qy 1456 

AAT GT C C GCAT AGAGGAGGAGCAGCCAGAGGAGGGGATGCCT CCAGCAAT ATT CAACAGT 1515 

Mill II III I I I I I I I I I I Ml 

MM 

Db 1747 AACGT CAAAGTATCTTTGGAAGCCTCGGAAGACGGCATCCT GGAAGC 

CAGT 1797 



Qy 1516 

CTTCCCTTGCCTCGGGCTGTCCTAGCCTCCCCTTGTGTGGCCACAGTTACCATCTTGGAT 1575 

II II II Ml MM I II I I I II I I I II I I I I I I 

I I I 

Db 1798 

CATGTCTCTACCCTTGCTTGCCTGGGATCCCCCTCCACTGCCACCGTGACTATTTTTGAT 1857 



Qy 1576 

GAT GAC CAT GCAGGCATCTT CACTTTT GAAT GT GAT ACT ATT CAT GT CAGTGAGAGTATT 1635 

II I II I II II I I I I I II I I I I I II I II I II I II I II I II I I I 

I I I 

Db 1858 

GAT GAC CAT GCT GGCATCTTT ACTTT T GAGGAACCGGTGACT CAT GT GAGTGAGAGCATT 1917 



Qy 1636 

GGTGTTATGGAGGTCAAGGTTCTGCGGACATCAGGTGCCCGGGGTACAGTCATCGTCCCC 1695 

II I M I I I I I I I I II I M I I II I I I II II II II I II II I I I 

I I I 

Db 1918 

GGC AT CAT GGAGGT GAAAGT T CT GAGAACAT CT GGAGCAC GT GGAAAT GTTAT CGTT CCC 1977 



Qy 1696 

TTT AGGACAGT AGAAGGGAC AGCCAAGGGTGGC GGT GAGGACTTT GAAGACACAT AT GGG 1755 

I II III I II I II II MM I I I I I II I I I I II I I I II I I II I II 

I I 

Db 1978 

T ATAAGAC CATT GAGGGGACCGCC AGAGGTGGAGGGGAGGACTT T GAGGACACAT GC GGA 2037 



Qy 1756 

GAGTT GGAAT T CAAGAAT GAT GAAACT GT GAAAAC CATAAGGGTTAAAAT AGT AGAT GAG 1815 

II I I I I I I I II II I II II II II I II I I I I I I II I I III 

II II I 

Db 2038 

GAGCT C GAGTT CCAGAAT GACGAAATT GT CAAAACAATAT CAGT CAAGGTAATT GAT GAT 2097 



Qy 1816 

GAGGAAT ACGAAAGGCAAGAGAAT TT CTT CAT T GC C CTT GGT GAACCGAAAT GGAT GG — 1873 



1 1 1 1 1 1 1 1 1 I I III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 

Db 2098 

GAGGAGT AT GAGAAAAACAAGAC CTT CTTCCT T GAGATT GGAGAGC CC CGCCT GGT GGAG 2157 

Qy 1874 " 

AACGTGGAA 1882 



Db 2158 

AT GAGT GAGAAGAAAGCCCT GTT AT T GAAT GAGCTT GGT GGCT TCACAAT AACAGGGAAA 2217 
Qy 1883 

TAT C AG AT GT G AC AG AC AG 1901 

I I I I I I I I I I 

Db 2218 

TACCTGTATGGCCAGCCTGT CTT CAGGAAAGTTCAT GCTAGAGAACATCCACTCCCCT CT 2277 

Qy 1902 

GAAGCT GACT AT GGAAGAAGAG 1923 

I I I I II I I I I I 

I I I 

Db 2278 

ACT ATAAT CACCAT C GCAGAT GAAT AT GATGACAAGCAGC C ACTGAC CAGCAAAGAGGAG 2337 
Qy 1924 

GAGGCCAAGAGGAT AGCAGAGAT GGGAAAGC C AGT ATTGGGT GAACACC C CAAACT AGAA 1983 

III I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I 

I I 

Db 2338 

GAAGAGAGGC GCATT GCGGAAAT GGGGCGCC C CATT CT GGGAGAGCACAC C AGACT GGAG 2397 
Qy 1984 

GT CATCATT GAAGAGTCCTATGAGTT CAAGACTACGGTGGACAAACTGATCAAGAAGACA 2043 

II I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I 

I I I I I I I I I 

Db 2398 

GT GAT CATT GAAGAAT C CT AC GAGT T CAAGAGTACCGT GGACAAACT GATTAAGAAGACA 2457 
Qy 2044 

AACCT GGCCTT GGT T GT GGGGACCCATT C CT GGAGGGACCAGTTCAT GGAGGCCAT CACC 2103 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I 

Db 2458 

AACCT AGCCCT CGT GGTT GGGACGAACAGCT GGAGAGAGCAGTTCAT CGAGGC GAT CACT 2517 
Qy 2104 

GTCAGTGCAGCAGGGGAT GAGGAT GAGGATGAATCCGGGGAGGAGAGGCT GCCCTCCTGC 2163 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I I I I I I I 
Db 2518 

GT C AGT GCT GGGGAAGAT GAC GAT GACGACGAAT GT GGGGAGGAGAAGCT GCC CT C CT GT 2577 
Qy 2164 

TTTGACTACGTCATGCACTTCCTGACTGTCTTCTGGAAGGTGCTGTTTGCCTGTGTGCCC 2223 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I III 

I I I 

Db 2578 

TTTGACTACGTGATGCACTTTCTGACTGTGTTCTGGAAGGTCCTCTTCGCCTTTGTCCCC 2637 



Qy 2224 



CCCACAGAGTACTGCCACGGCTGGGCCTGCTTCGCCGTCTCCATCCTCATCATTGGCATG 2283 

II I I I I I I I I I I I I I I I I I I I I I II III I I I I I I I I II I I I I I II III 

I 

Db 2638 

C C GAC AGAGT ACT GGAACGGCT GGGC GT GTT T C AT CGT CT CCAT C CT CAT GATCGGC CT A 2697 
Qy 2284 

CTCACCGCCATCATTGGGGACCTGGCCTCGCACTTCGGCTGCACCATTGGTCTCAAAGAT 2343 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 2698 

CT GACGGCT TTC ATT GGAGAC CT C GCTT CCCACT T CGCCT GCACC AT C GC CCT GAAGGAT 2757 
Qy 2344 

TCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAGATACGTTTGCCAGC 2403 

II II II II II II I I I I I II I I I I Mill II I I I I I II I I I I I 

III 

Db 2758 

TCCGTGACCGCGGTGGTGTTCGTTGCGCTTGGAACCTCAGTGCCAGACACATTTGCAAGC 2817 
Qy 2404 

AAAGCT GCT GC CCTC CAGGAT GT AT AT GCAGACGC CT CCATT GGCAAC GT GACGGGCAGC 2463 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

I I I I I I 

Db 2818 

AAAGTGGCCGCCAC C CAGGACCAGT AT GCGGAT GCAT CCATAGGTAAC GT CACAGGCAGC 2877 
Qy 2464 

AACGCCGTCAATGTCTTCCTGGGCATCGGCCTGGCCTGGTCCGTGGCCGCCATCTACTGG 2523 

I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2878 

AACGCGGTGAACGTCTTCCTGGGCATCGGTGTGGCCTGGTCCATCGCCGCCATCTACCAC 2937 
Qy 2524 

GCTCTGCAGGGACAGGAGTTCCACGTGTCGGCCGGCACACTGGCCTTCTCCGTCACCCTC 2583 

II III I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

I I I 

Db 2938 

GCGGCCAACGGGGAACAGTTCAAAGTGTCCCCTGGCACGCTAGCTTTTTCTGTCACTCTC 2997 
Qy 2584 

TTCACCATCTTTGCATTTGTCTGCATCAGCGTGCTCTTGTACCGAAGGCGGCCGCACCTG 2643 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

I 

Db 2998 

TTCACCATTTTTGCTTTCATCAATGTGGGGGTGCTGCTGTATCGGCGGAGGCCAGAAATT 3057 
Qy 2644 

GGAGGGGAGCTTGGTGGCCCCCGTGGCTGCAAGCTCGCCACAACATGGCTCTTTGTGAGC 2703 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

I 

Db 3058 

GGAGGTGAGCTGGGTGGGCCCCGGACTGCCAAGCTCCTCACATCCTGCCTCTTTGTGCTC 3117 
Qy 2704 

CTGTGGCTCCTCTACATACTCTTTGCCACACTAGAGGCCTATTGCTACATCAAGGGGTTC 2763 

I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I 

I I I 

Db 3118 

CTGTGGCTCTTGTACATTTTCTTCTCCTCCCTGGAGGCCTACTGCCACATAAAAGGCTTC 3177 



Qy 2764 TAA 2766 

I I I 

Db 3178 TAA 3180 
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